Abstract
Deep reinforcement Learning (DRL) offers a powerful framework for training AI agents to coordinate with human partners. However, DRL faces two critical challenges in human-AI coordination (HAIC): sparse rewards and unpredictable human behaviors. These challenges significantly limit DRL to identify effective coordination policies, due to its impaired capability of optimizing exploration and exploitation. To address these limitations, we propose an innovative behavior- and context-aware reward (BCR) for DRL, which optimizes exploration and exploitation by leveraging human behaviors and contextual information in HAIC. Our BCR consists of two components: (i) A novel dual intrinsic rewarding scheme to enhance exploration. This scheme composes an AI self-motivated intrinsic reward and a human-motivated intrinsic reward, which are designed to increase the capture of sparse rewards by a logarithmic-based strategy; and (ii) A new context-aware weighting mechanism for the designed rewards to improve exploitation. This mechanism helps the AI agent prioritize actions that better coordinate with the human partner by utilizing contextual information that can reflect the evolution of learning. Extensive simulations in the Overcooked environment demonstrate that our approach can increase the cumulative sparse rewards by approximately 20%, and improve the sample efficiency by around 38% compared to state-of-the-art baselines.
| Original language | English |
|---|---|
| Title of host publication | ECAI 2025 - 28th European Conference on Artificial Intelligence, including 14th Conference on Prestigious Applications of Intelligent Systems, PAIS 2025 - Proceedings |
| Editors | Ines Lynce, Nello Murano, Mauro Vallati, Serena Villata, Federico Chesani, Michela Milano, Andrea Omicini, Mehdi Dastani |
| Place of Publication | Amsterdam Netherlands |
| Publisher | IOS Press |
| Pages | 2065-2073 |
| Number of pages | 9 |
| ISBN (Electronic) | 9781643686318 |
| DOIs | |
| Publication status | Published - 2025 |
| Event | European Conference on Artificial Intelligence 2025 - Bologna, Italy Duration: 25 Oct 2025 → 30 Oct 2025 Conference number: 28th https://ecai2025.org/ (Website) https://ebooks.iospress.nl/volume/ecai-2025-28th-european-conference-on-artificial-intelligence-bologna-including-14th-conference-on-prestigious-applications-of-intelligent-systems-pais-2025 (Proceedings) |
Publication series
| Name | Frontiers in Artificial Intelligence and Applications |
|---|---|
| Publisher | IOS Press |
| Volume | 413 |
| ISSN (Print) | 0922-6389 |
| ISSN (Electronic) | 1879-8314 |
Conference
| Conference | European Conference on Artificial Intelligence 2025 |
|---|---|
| Abbreviated title | ECAI 2025 |
| Country/Territory | Italy |
| City | Bologna |
| Period | 25/10/25 → 30/10/25 |
| Internet address |
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver