Abstract
Defining the behaviour of a set of situated agents, such that a collaborative problem can be solved is a key issue in multiagent systems. In this paper, we formulate this problem from the decision theoretic perspective using the framework of Decentralized Partially Observable Markov Decision Processes (DEC-POMDP). Formulating the coordination problem in this way provides a formal foundation for study of cooperation activities. But, as it has been recently shown solving DEC-POMDP is NEXP-complete and thus it is not a realistic approach for the design of agent cooperation policies. However, we demonstrate in this paper that it is not completely desperate. Indeed, we propose an heuristic approach for solving DEC-POMDP when agents are memory-less and when the global reward function can be broken up into a sum of local reward functions. We demonstrate experimentally on an example (the so-called pursuit problem) that this heuristic is efficient within a few iteration steps.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the ACM Symposium on Applied Computing |
| Pages | 57-62 |
| Number of pages | 6 |
| Publication status | Published - 2002 |
| Externally published | Yes |
| Event | ACM Symposium on Applied Computing 2002 - Madrid, Spain Duration: 10 Mar 2002 → 14 Mar 2002 Conference number: 17th https://dl.acm.org/citation.cfm?id=508791&picked=prox (ACM Digital Library Proceedings) |
Conference
| Conference | ACM Symposium on Applied Computing 2002 |
|---|---|
| Abbreviated title | SAC 2002 |
| Country/Territory | Spain |
| City | Madrid |
| Period | 10/03/02 → 14/03/02 |
| Internet address |
|
Keywords
- Decision theoretic agents
- Multiagent systems
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver