TY - JOUR
T1 - Constrained multiagent Markov decision processes
T2 - a taxonomy of problems and algorithms
AU - de Nijs, Frits
AU - Walraven, Erwin
AU - de Weerdt, Mathijs M.
AU - Spaan, Matthijs T.J.
N1 - Funding Information:
Authors Frits de Nijs and Erwin Walraven contributed equally to this article. This work is funded by distribution system operator Alliander, and by the Netherlands Organisation for Scientific Research (NWO), as part of the Uncertainty Reduction in Smart Energy Systems program.
Publisher Copyright:
© 2021 AI Access Foundation. All rights reserved.
Copyright:
Copyright 2021 Elsevier B.V., All rights reserved.
PY - 2021/3/8
Y1 - 2021/3/8
N2 - In domains such as electric vehicle charging, smart distribution grids and autonomous warehouses, multiple agents share the same resources. When planning the use of these resources, agents need to deal with the uncertainty in these domains. Although several models and algorithms for such constrained multiagent planning problems under uncertainty have been proposed in the literature, it remains unclear when which algorithm can be applied. In this survey we conceptualize these domains and establish a generic problem class based on Markov decision processes. We identify and compare the conditions under which algorithms from the planning literature for problems in this class can be applied: whether constraints are soft or hard, whether agents are continuously connected, whether the domain is fully observable, whether a constraint is momentarily (instantaneous) or on a budget, and whether the constraint is on a single resource or on multiple. Further we discuss the advantages and disadvantages of these algorithms. We conclude by identifying open problems that are directly related to the conceptualized domains, as well as in adjacent research areas.
AB - In domains such as electric vehicle charging, smart distribution grids and autonomous warehouses, multiple agents share the same resources. When planning the use of these resources, agents need to deal with the uncertainty in these domains. Although several models and algorithms for such constrained multiagent planning problems under uncertainty have been proposed in the literature, it remains unclear when which algorithm can be applied. In this survey we conceptualize these domains and establish a generic problem class based on Markov decision processes. We identify and compare the conditions under which algorithms from the planning literature for problems in this class can be applied: whether constraints are soft or hard, whether agents are continuously connected, whether the domain is fully observable, whether a constraint is momentarily (instantaneous) or on a budget, and whether the constraint is on a single resource or on multiple. Further we discuss the advantages and disadvantages of these algorithms. We conclude by identifying open problems that are directly related to the conceptualized domains, as well as in adjacent research areas.
KW - Markov decision processes
KW - Multiagent Systems
UR - http://www.scopus.com/inward/record.url?scp=85103679548&partnerID=8YFLogxK
U2 - 10.1613/JAIR.1.12233
DO - 10.1613/JAIR.1.12233
M3 - Article
AN - SCOPUS:85103679548
VL - 70
SP - 955
EP - 1001
JO - Journal of Artificial Intelligence Research
JF - Journal of Artificial Intelligence Research
SN - 1076-9757
ER -