Skip to main navigation Skip to search Skip to main content

BCR-DRL: Behavior- and Context-Aware Reward for Deep Reinforcement Learning in Human-AI Coordination

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

Abstract

Deep reinforcement Learning (DRL) offers a powerful framework for training AI agents to coordinate with human partners. However, DRL faces two critical challenges in human-AI coordination (HAIC): sparse rewards and unpredictable human behaviors. These challenges significantly limit DRL to identify effective coordination policies, due to its impaired capability of optimizing exploration and exploitation. To address these limitations, we propose an innovative behavior- and context-aware reward (BCR) for DRL, which optimizes exploration and exploitation by leveraging human behaviors and contextual information in HAIC. Our BCR consists of two components: (i) A novel dual intrinsic rewarding scheme to enhance exploration. This scheme composes an AI self-motivated intrinsic reward and a human-motivated intrinsic reward, which are designed to increase the capture of sparse rewards by a logarithmic-based strategy; and (ii) A new context-aware weighting mechanism for the designed rewards to improve exploitation. This mechanism helps the AI agent prioritize actions that better coordinate with the human partner by utilizing contextual information that can reflect the evolution of learning. Extensive simulations in the Overcooked environment demonstrate that our approach can increase the cumulative sparse rewards by approximately 20%, and improve the sample efficiency by around 38% compared to state-of-the-art baselines.

Original languageEnglish
Title of host publicationECAI 2025 - 28th European Conference on Artificial Intelligence, including 14th Conference on Prestigious Applications of Intelligent Systems, PAIS 2025 - Proceedings
EditorsInes Lynce, Nello Murano, Mauro Vallati, Serena Villata, Federico Chesani, Michela Milano, Andrea Omicini, Mehdi Dastani
Place of PublicationAmsterdam Netherlands
PublisherIOS Press
Pages2065-2073
Number of pages9
ISBN (Electronic)9781643686318
DOIs
Publication statusPublished - 2025
EventEuropean Conference on Artificial Intelligence 2025 - Bologna, Italy
Duration: 25 Oct 202530 Oct 2025
Conference number: 28th
https://ecai2025.org/ (Website)
https://ebooks.iospress.nl/volume/ecai-2025-28th-european-conference-on-artificial-intelligence-bologna-including-14th-conference-on-prestigious-applications-of-intelligent-systems-pais-2025 (Proceedings)

Publication series

NameFrontiers in Artificial Intelligence and Applications
PublisherIOS Press
Volume413
ISSN (Print)0922-6389
ISSN (Electronic)1879-8314

Conference

ConferenceEuropean Conference on Artificial Intelligence 2025
Abbreviated titleECAI 2025
Country/TerritoryItaly
CityBologna
Period25/10/2530/10/25
Internet address

Cite this