Inductive biases for low data VQA: a data augmentation approach

Narjes Askarian, Ehsan Abbasnejad, Ingrid Zukerman, Wray Buntine, Gholamreza Haffari

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

4 Citations (Scopus)

Abstract

Visual question answering (VQA) is the problem of understanding rich image contexts and answering complex natural language questions about them. VQA models have recently achieved remarkable results when training on large-scale labeled datasets. However, annotating large amounts of data is not feasible in many domains. In this paper, we address the problem of VQA in low labeled data regime, which is under-explored in the literature. We take a data augmentation approach to enlarge the initial small labeled data in order to inject proper inductive biases to the VQA model. We encode the additional inductive biases in the questions by producing new ones taking advantage of the image annotations. Our results show up to 34% accuracy improvements compared to the baselines trained on only the initial labeled data.

Original languageEnglish
Title of host publicationProceedings - 2022 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, WACVW 2022
EditorsRyan Farrell, Catherine Zhao, Saket Anand, Richard Souvenir
Place of PublicationPiscataway NJ USA
PublisherIEEE, Institute of Electrical and Electronics Engineers
Pages231-240
Number of pages10
ISBN (Electronic)9781665458245
ISBN (Print)9781665458252
DOIs
Publication statusPublished - 2022
EventIEEE Winter Conference on Applications of Computer Vision Workshops 2022 - Waikoloa, United States of America
Duration: 4 Jan 20228 Jan 2022
https://ieeexplore.ieee.org/xpl/conhome/9707470/proceeding (Proceedings)

Publication series

NameProceedings - 2022 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, WACVW 2022
PublisherIEEE, Institute of Electrical and Electronics Engineers
ISSN (Print)2690-621X
ISSN (Electronic)2690-621X

Conference

ConferenceIEEE Winter Conference on Applications of Computer Vision Workshops 2022
Abbreviated titleWACVW 2022
Country/TerritoryUnited States of America
CityWaikoloa
Period4/01/228/01/22
Internet address

Cite this