Multi-agent reinforcement learning with temporal logic specifications

Lewis Hammond, Alessandro Abate, Julian Gutierrez, Michael Wooldridge

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

22 Citations (Scopus)

Abstract

In this paper, we study the problem of learning to satisfy temporal logic specifications with a group of agents in an unknown environment, which may exhibit probabilistic behaviour. From a learning perspective these specifications provide a rich formal language with which to capture tasks or objectives, while from a logic and automated verification perspective the introduction of learning capabilities allows for practical applications in large, stochastic, unknown environments. The existing work in this area is, however, limited. Of the frameworks that consider full linear temporal logic or have correctness guarantees, all methods thus far consider only the case of a single temporal logic specification and a single agent. In order to overcome this limitation, we develop the first multi-agent reinforcement learning technique for temporal logic specifications, which is also novel in its ability to handle multiple specifications. We provide correctness and convergence guarantees for our main algorithm - Almanac (Automaton/Logic Multi-Agent Natural Actor-Critic) - even when using function approximation. Alongside our theoretical results, we further demonstrate the applicability of our technique via a set of preliminary experiments.

Original languageEnglish
Title of host publicationAAMAS'21, Proceedings of the 20th International Conference on Autonomous Agents and Multiagent Systems
EditorsUlle Endriss, Ann Nowé
Place of PublicationNew York NY USA
PublisherAssociation for Computing Machinery (ACM)
Pages583-592
Number of pages10
ISBN (Electronic)9781713832621
ISBN (Print)9781450383073
DOIs
Publication statusPublished - 2021
EventInternational Conference on Autonomous Agents and Multiagent Systems 2021 - Online, United Kingdom
Duration: 3 May 20217 May 2021
Conference number: 20th
https://dl.acm.org/doi/proceedings/10.5555/3463952 (Proceedings)
https://aamas2021.soton.ac.uk (Website)

Publication series

NameProceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems, AAMAS
Volume1
ISSN (Print)1548-8403
ISSN (Electronic)1558-2914

Conference

ConferenceInternational Conference on Autonomous Agents and Multiagent Systems 2021
Abbreviated titleAAMAS 2021
Country/TerritoryUnited Kingdom
Period3/05/217/05/21
Internet address

Keywords

  • Automata
  • Formal methods
  • Multi-agent reinforcement learning
  • Multi-objective reinforcement learning
  • Temporal logic

Cite this