TY - JOUR
T1 - Towards designing a generic and comprehensive deep reinforcement learning framework
AU - Nguyen, Ngoc Duy
AU - Nguyen, Thanh Thi
AU - Pham, Nhat Truong
AU - Nguyen, Hai
AU - Nguyen, Dang Tu
AU - Nguyen, Thanh Dang
AU - Lim, Chee Peng
AU - Johnstone, Michael
AU - Bhatti, Asim
AU - Creighton, Douglas
AU - Nahavandi, Saeid
N1 - Funding Information:
The authors wish to thank our colleagues in the Institute for Intelligent Systems Research and Innovation for their comments and helpful discussion. We truly appreciate Nguyen Chau, a principal IT product manager at Atlassian, who shared his expertise in the field to eradicate potential misunderstanding in this paper. We also thank Dr. Thanh Nguyen, University of Chicago, for being an active adviser in the design process. Finally, we are grateful to the RL community in providing crucial feedback during the project beta testing phase.
Publisher Copyright:
© 2022, The Author(s).
PY - 2023/2
Y1 - 2023/2
N2 - Reinforcement learning (RL) has emerged as an effective approach for building an intelligent system, which involves multiple self-operated agents to collectively accomplish a designated task. More importantly, there has been a renewed focus on RL since the introduction of deep learning that essentially makes RL feasible to operate in high-dimensional environments. However, there are many diversified research directions in the current literature, such as multi-agent and multi-objective learning, and human-machine interactions. Therefore, in this paper, we propose a comprehensive software architecture that not only plays a vital role in designing a connect-the-dots deep RL architecture but also provides a guideline to develop a realistic RL application in a short time span. By inheriting the proposed architecture, software managers can foresee any challenges when designing a deep RL-based system. As a result, they can expedite the design process and actively control every stage of software development, which is especially critical in agile development environments. For this reason, we design a deep RL-based framework that strictly ensures flexibility, robustness, and scalability. To enforce generalization, the proposed architecture also does not depend on a specific RL algorithm, a network configuration, the number of agents, or the type of agents.
AB - Reinforcement learning (RL) has emerged as an effective approach for building an intelligent system, which involves multiple self-operated agents to collectively accomplish a designated task. More importantly, there has been a renewed focus on RL since the introduction of deep learning that essentially makes RL feasible to operate in high-dimensional environments. However, there are many diversified research directions in the current literature, such as multi-agent and multi-objective learning, and human-machine interactions. Therefore, in this paper, we propose a comprehensive software architecture that not only plays a vital role in designing a connect-the-dots deep RL architecture but also provides a guideline to develop a realistic RL application in a short time span. By inheriting the proposed architecture, software managers can foresee any challenges when designing a deep RL-based system. As a result, they can expedite the design process and actively control every stage of software development, which is especially critical in agile development environments. For this reason, we design a deep RL-based framework that strictly ensures flexibility, robustness, and scalability. To enforce generalization, the proposed architecture also does not depend on a specific RL algorithm, a network configuration, the number of agents, or the type of agents.
KW - Deep learning
KW - Human-machine interactions
KW - Learning systems
KW - Multi-agent systems
KW - Reinforcement learning
KW - Software architecture
UR - http://www.scopus.com/inward/record.url?scp=85130956613&partnerID=8YFLogxK
U2 - 10.1007/s10489-022-03550-z
DO - 10.1007/s10489-022-03550-z
M3 - Article
AN - SCOPUS:85130956613
SN - 0924-669X
VL - 53
SP - 2967
EP - 2988
JO - Applied Intelligence
JF - Applied Intelligence
IS - 3
ER -