Abstract
Complex question-answering (CQA) involves answering complex natural-language questions on a knowledge base (KB). However, the conventional neural program induction (NPI) approach exhibits uneven performance when the questions have different types, harboring inherently different characteristics, e.g., difficulty level. This paper proposes a meta-reinforcement learning approach to program induction in CQA to tackle the potential distributional bias in questions. Our method quickly and effectively adapts the meta-learned programmer to new questions based on the most similar questions retrieved from the training data. The meta-learned policy is then used to learn a good programming policy, utilizing the trial trajectories and their rewards for similar questions in the support set. Our method achieves state-of-the-art performance on the CQA dataset (Saha et al., 2018) while using only five trial trajectories for the top-5 retrieved questions in each support set, and meta-training on tasks constructed from only 1% of the training set. We have released our code at https://github.com/DevinJake/MRL-CQA.
Original language | English |
---|---|
Title of host publication | EMNLP 2020, 2020 Conference on Empirical Methods in Natural Language Processing |
Subtitle of host publication | Proceedings of the Conference |
Editors | Trevor Cohn, Yulan He, Yang Liu |
Place of Publication | Stroudsburg PA USA |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 5827–5837 |
Number of pages | 11 |
ISBN (Electronic) | 9781952148606 |
DOIs | |
Publication status | Published - 2020 |
Event | Empirical Methods in Natural Language Processing 2020 - Virtual, Punta Cana, Dominican Republic Duration: 16 Nov 2020 → 20 Nov 2020 https://2020.emnlp.org/ (Website) http://www.aclweb.org/anthology/volumes/2020.emnlp-main/ (Proceedings) https://aclanthology.org/volumes/2020.findings-emnlp/ |
Conference
Conference | Empirical Methods in Natural Language Processing 2020 |
---|---|
Abbreviated title | EMNLP 2020 |
Country/Territory | Dominican Republic |
City | Punta Cana |
Period | 16/11/20 → 20/11/20 |
Internet address |