Deep reinforcement learning for multiagent systems: A review of challenges, solutions, and applications

Thanh Thi Nguyen, Ngoc Duy Nguyen, Saeid Nahavandi

Research output: Contribution to journalArticleResearchpeer-review

840 Citations (Scopus)

Abstract

Reinforcement learning (RL) algorithms have been around for decades and employed to solve various sequential decision-making problems. These algorithms, however, have faced great challenges when dealing with high-dimensional environments. The recent development of deep learning has enabled RL methods to drive optimal policies for sophisticated and capable agents, which can perform efficiently in these challenging environments. This article addresses an important aspect of deep RL related to situations that require multiple agents to communicate and cooperate to solve complex tasks. A survey of different approaches to problems related to multiagent deep RL (MADRL) is presented, including nonstationarity, partial observability, continuous state and action spaces, multiagent training schemes, and multiagent transfer learning. The merits and demerits of the reviewed methods will be analyzed and discussed with their corresponding applications explored. It is envisaged that this review provides insights about various MADRL methods and can lead to the future development of more robust and highly useful multiagent learning methods for solving real-world problems.

Original languageEnglish
Pages (from-to)3826-3839
Number of pages14
JournalIEEE Transactions on Cybernetics
Volume50
Issue number9
DOIs
Publication statusPublished - Sept 2020
Externally publishedYes

Keywords

  • Continuous action space
  • deep learning
  • deep reinforcement learning (RL)
  • multiagent
  • nonstationary
  • partial observability
  • review
  • robotics
  • survey

Cite this