The best of both worlds: Combining human and machine translations for multilingual semantic parsing with active learning

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

Abstract

Multilingual semantic parsing aims to leverage the knowledge from the high-resource languages to improve low-resource semantic parsing, yet commonly suffers from the data imbalance problem. Prior works propose to utilize the translations by either humans or machines to alleviate such issues. However, human translations are expensive, while machine translations are cheap but prone to error and bias. In this work, we propose an active learning approach that exploits the strengths of both human and machine translations by iteratively adding small batches of human translations into the machine-translated training set. Besides, we propose novel aggregated acquisition criteria that help our active learning method select utterances to be manually translated. Our experiments demonstrate that an ideal utterance selection can significantly reduce the error and bias in the translated data, resulting in higher parser accuracies than the parsers merely trained on the machine-translated data.
Original languageEnglish
Title of host publicationACL 2023 - The 61st Conference of the the Association for Computational Linguistics - Proceedings of the Conference Volume 1: Long Papers
EditorsAnna Rogers, Jordan Boyd-Graber, Naoaki Okazaki
Place of PublicationStroudsburg PA USA
PublisherAssociation for Computational Linguistics (ACL)
Pages9511–9528
Number of pages18
ISBN (Electronic)9781959429722
DOIs
Publication statusPublished - 2023
EventAnnual Meeting of the Association of Computational Linguistics 2023 - Toronto, Canada
Duration: 9 Jul 202314 Jul 2023
Conference number: 61st
https://aclanthology.org/volumes/2023.acl-long/ (Proceedings - 1)
https://aclanthology.org/volumes/2023.findings-acl/ (Proceedings - 2)
https://2023.aclweb.org/ (Website)

Conference

ConferenceAnnual Meeting of the Association of Computational Linguistics 2023
Abbreviated titleACL 2023
Country/TerritoryCanada
CityToronto
Period9/07/2314/07/23
Internet address

Cite this