Graceful Task Adaptation with a Bi-hemispheric RL Agent

Grant Nicholas, Levin Kuhlmann, Gideon Kowadlo

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

Abstract

In humans, responsibility for performing a task gradually shifts from the right hemisphere to the left. The Novelty-Routine Hypothesis (NRH) states that the right and left hemispheres are used to perform novel and routine tasks respectively, enabling us to learn a diverse range of novel tasks while performing the task capably. Drawing on the NRH, we develop a reinforcement learning agent with specialised hemispheres that can exploit generalist knowledge from the right-hemisphere to avoid poor initial performance on novel tasks. In addition, we find that this design has minimal impact on its ability to learn novel tasks. We conclude by identifying improvements to our agent and exploring potential expansion to the continual learning setting.

Original languageEnglish
Title of host publicationAI 2024, Advances in Artificial Intelligence, 37th Australasian Joint Conference on Artificial Intelligence, AI 2024 Melbourne, VIC, Australia, November 25–29, 2024 Proceedings, Part II
EditorsMingming Gong, Yiliao Song, Yun Sing Koh, Wei Xiang, Derui Wang
Place of PublicationSingapore Singapore
PublisherSpringer
Pages26-38
Number of pages13
ISBN (Electronic)9789819603510
ISBN (Print)9789819603503
DOIs
Publication statusPublished - 2025
EventAustralasian Joint Conference on Artificial Intelligence 2024 - Melbourne, Australia
Duration: 25 Nov 202429 Nov 2024
Conference number: 37th
https://ajcai2024.org/ (Conference website)
https://doi.org/10.1007/978-981-96-0351-0 (Conference proceedings)

Publication series

NameLecture Notes in Computer Science
PublisherSpringer
Volume15443
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceAustralasian Joint Conference on Artificial Intelligence 2024
Abbreviated titleAJCAI 2024
Country/TerritoryAustralia
CityMelbourne
Period25/11/2429/11/24
Internet address

Cite this