Vol. 2 No. 2 (2022): Journal of AI-Assisted Scientific Discovery
Articles

A reinforcement learning approach for training complex decision making models

Sarbaree Mishra
Program Manager at Molina Healthcare Inc., USA
Cover

Published 11-07-2022

Keywords

  • Reinforcement learning,
  • decision-making

How to Cite

[1]
Sarbaree Mishra, “A reinforcement learning approach for training complex decision making models”, Journal of AI-Assisted Scientific Discovery, vol. 2, no. 2, pp. 329–352, Jul. 2022, Accessed: Dec. 30, 2024. [Online]. Available: https://scienceacadpress.com/index.php/jaasd/article/view/238

Abstract

Reinforcement learning (RL) is a powerful branch of machine learning that enables systems to learn optimal strategies through trial-and-error interactions with their environments, making it a natural fit for tackling complex decision-making problems. Unlike traditional methods that rely on predefined rules or labelled datasets, RL trains models by rewarding desired behaviours, allowing them to adapt dynamically to changing conditions. This ability to self-learn and improve has made RL increasingly crucial across industries, from robotics and gaming to finance and healthcare, where intelligent systems must make nuanced decisions in unpredictable settings. This article explores the core principles of reinforcement learning, shedding light on how agents learn by balancing exploration and exploitation. We dive into popular algorithms like Q-learning, Deep Q-Networks, and Policy Gradient methods, explaining their relevance in solving real-world challenges. Through practical examples, such as optimizing supply chain logistics or enhancing autonomous vehicle navigation, we illustrate the transformative potential of RL in training systems to handle intricate decision-making tasks. However, implementing RL in real-world scenarios is not without hurdles—issues like sample inefficiency, reward shaping, and the complexity of scaling solutions can impede progress. We provide actionable recommendations for addressing these challenges, including leveraging hybrid methods, improving environment simulation fidelity, and designing robust reward structures. Furthermore, we discuss the importance of combining RL with other techniques, such as supervised learning or evolutionary algorithms, to unlock its full potential. This discussion highlights RL's opportunities and limitations, emphasizing the need for continued innovation and collaboration between researchers and practitioners. This article is a comprehensive guide for those looking to harness reinforcement learning in building intelligent, adaptable decision-making models by bridging theoretical concepts with hands-on strategies.

Downloads

Download data is not yet available.

References

  1. Kulkarni, P. (2012). Reinforcement and systemic machine learning for decision making (Vol. 1). John Wiley & Sons.
  2. Xu, X., Zuo, L., Li, X., Qian, L., Ren, J., & Sun, Z. (2018). A reinforcement learning approach to autonomous decision making of intelligent vehicles on highways. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 50(10), 3884-3897.
  3. Shi, H., & Xu, M. (2019). A multiple-attribute decision-making approach to reinforcement learning. IEEE Transactions on Cognitive and Developmental Systems, 12(4), 695-708.
  4. Kelemen, A., Liang, Y., & Franklin, S. (2002). A comparative study of different machine learning approaches for decision making.
  5. Wu, W., Huang, Z., Zeng, J., & Fan, K. (2021). A fast decision-making method for process planning with dynamic machining resources via deep reinforcement learning. Journal of manufacturing systems, 58, 392-411.
  6. Shortreed, S. M., Laber, E., Lizotte, D. J., Stroup, T. S., Pineau, J., & Murphy, S. A. (2011). Informing sequential clinical decision-making through reinforcement learning: an empirical study. Machine learning, 84, 109-136.
  7. Loftus, T. J., Filiberto, A. C., Li, Y., Balch, J., Cook, A. C., Tighe, P. J., ... & Bihorac, A. (2020). Decision analysis and reinforcement learning in surgical decision-making. Surgery, 168(2), 253-266.
  8. He, Y., Xing, L., Chen, Y., Pedrycz, W., Wang, L., & Wu, G. (2020). A generic Markov decision process model and reinforcement learning method for scheduling agile earth observation satellites. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 52(3), 1463-1474.
  9. Rogova, G., & Kasturi, J. (2001, August). Reinforcement learning neural network for distributed decision making. In Proc. of the Forth Conf. on Information Fusion.
  10. Dayan, P., & Daw, N. D. (2008). Decision theory, reinforcement learning, and the brain. Cognitive, Affective, & Behavioral Neuroscience, 8(4), 429-453.
  11. Pednault, E., Abe, N., & Zadrozny, B. (2002, July). Sequential cost-sensitive decision making with reinforcement learning. In Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining (pp. 259-268).
  12. Shi, H., Lin, Z., Zhang, S., Li, X., & Hwang, K. S. (2018). An adaptive decision-making method with fuzzy Bayesian reinforcement learning for robot soccer. Information Sciences, 436, 268-281.
  13. Tsoukalas, A., Albertson, T., & Tagkopoulos, I. (2015). From data to optimal decision making: a data-driven, probabilistic machine learning approach to decision support for patients with sepsis. JMIR medical informatics, 3(1), e3445.
  14. Jayatilake, S. M. D. A. C., & Ganegoda, G. U. (2021). Involvement of machine learning tools in healthcare decision making. Journal of healthcare engineering, 2021(1), 6679512.
  15. He, X., Fei, C., Liu, Y., Yang, K., & Ji, X. (2020, September). Multi-objective longitudinal decision-making for autonomous electric vehicle: a entropy-constrained reinforcement learning approach. In 2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC) (pp. 1-6). IEEE.
  16. Thumburu, S. K. R. (2021). A Framework for EDI Data Governance in Supply Chain Organizations. Innovative Computer Sciences Journal, 7(1).
  17. Thumburu, S. K. R. (2021). The Future of EDI Standards in an API-Driven World. MZ Computing Journal, 2(2).
  18. Thumburu, S. K. R. (2021). Transitioning to Cloud-Based EDI: A Migration Framework, Journal of Innovative Technologies, 4(1).
  19. Gade, K. R. (2020). Data Analytics: Data Privacy, Data Ethics, Data Monetization. MZ Computing Journal, 1(1).
  20. Gade, K. R. (2021). Data Analytics: Data Democratization and Self-Service Analytics Platforms Empowering Everyone with Data. MZ Computing Journal, 2(1).
  21. Katari, A. Conflict Resolution Strategies in Financial Data Replication Systems.
  22. Katari, A., Muthsyala, A., & Allam, H. HYBRID CLOUD ARCHITECTURES FOR FINANCIAL DATA LAKES: DESIGN PATTERNS AND USE CASES.
  23. Komandla, V. Transforming Financial Interactions: Best Practices for Mobile Banking App Design and Functionality to Boost User Engagement and Satisfaction.
  24. Komandla, V. Strategic Feature Prioritization: Maximizing Value through User-Centric Roadmaps.
  25. Thumburu, S. K. R. (2020). Enhancing Data Compliance in EDI Transactions. Innovative Computer Sciences Journal, 6(1).