Gaussian Process Bandits with Aggregated Feedback

Mengyan Zhang, Russell Tsuchida, Cheng Soon Ong

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

3 Citations (Scopus)

Abstract

We consider the continuum-armed bandits problem, under a novel setting of recommending the best arms within a fixed budget under aggregated feedback. This is motivated by applications where the precise rewards are impossible or expensive to obtain, while an aggregated reward or feedback, such as the average over a subset, is available. We constrain the set of reward functions by assuming that they are from a Gaussian Process and propose the Gaussian Process Optimistic Optimisation (GPOO) algorithm. We adaptively construct a tree with nodes as subsets of the arm space, where the feedback is the aggregated reward of representatives of a node. We propose a new simple regret notion with respect to aggregated feedback on the recommended arms. We provide theoretical analysis for the proposed algorithm, and recover single point feedback as a special case. We illustrate GPOO and compare it with related algorithms on simulated data.

Original languageEnglish
Title of host publicationThirty-Sixth AAAI Conference on Artificial Intelligence
EditorsVasant Honavar, Matthijs Spaan
Place of PublicationPalo Alto CA USA
PublisherAssociation for the Advancement of Artificial Intelligence (AAAI)
Pages9074-9081
Number of pages8
ISBN (Electronic)1577358767, 9781577358763
DOIs
Publication statusPublished - 2022
Externally publishedYes
EventAAAI Conference on Artificial Intelligence 2022 - Online, United States of America
Duration: 22 Feb 20221 Mar 2022
Conference number: 36th
https://aaai-2022.virtualchair.net/index.html (Website)
https://aaai.org/conference/aaai/aaai-22/
https://ojs.aaai.org/index.php/AAAI/issue/view/510 (Proceedings)

Publication series

NameProceedings of the 36th AAAI Conference on Artificial Intelligence, AAAI 2022
PublisherAssociation for the Advancement of Artificial Intelligence (AAAI)
Volume36
ISSN (Print)2159-5399
ISSN (Electronic)2374-3468

Conference

ConferenceAAAI Conference on Artificial Intelligence 2022
Abbreviated titleAAAI 2022
Country/TerritoryUnited States of America
CityOnline
Period22/02/221/03/22
Internet address

Keywords

  • Machine Learning (ML)

Cite this