Abstract
Neural network-based methods represent the state-of-the-art in question generation from text. Existing work focuses on generating only questions from text without concerning itself with answer generation. Moreover, our analysis shows that handling rare words and generating the most appropriate question given a candidate answer are still challenges facing existing approaches. We present a novel two-stage process to generate question-answer pairs from the text. For the first stage, we present alternatives for encoding the span of the pivotal answer in the sentence using Pointer Networks. In our second stage, we employ sequence to sequence models for question generation, enhanced with rich linguistic features. Finally, global attention and answer encoding are used for generating the question most relevant to the answer. We motivate and linguistically analyze the role of each component in our framework and consider compositions of these. This analysis is supported by extensive experimental evaluations. Using standard evaluation metrics as well as human evaluations, our experimental results validate the significant improvement in the quality of questions generated by our framework over the state-of-the-art. The technique presented here represents another step towards more automated reading comprehension assessment. We also present a live system (Demo of the system is available at https://www.cse.iitb.ac.in/~vishwajeet/autoqg.html.) to demonstrate the effectiveness of our approach.
| Original language | English |
|---|---|
| Title of host publication | Advances in Knowledge Discovery and Data Mining |
| Subtitle of host publication | 22nd Pacific-Asia Conference, PAKDD 2018 Melbourne, VIC, Australia, June 3–6, 2018 Proceedings, Part III |
| Editors | Dinh Phung, Vincent S. Tseng, Geoffrey I. Webb, Bao Ho, Mohadeseh Ganji, Lida Rashidi |
| Place of Publication | Cham Switzerland |
| Publisher | Springer |
| Pages | 335-348 |
| Number of pages | 14 |
| ISBN (Electronic) | 9783319930404 |
| ISBN (Print) | 9783319930398 |
| DOIs | |
| Publication status | Published - 2018 |
| Event | Pacific-Asia Conference on Knowledge Discovery and Data Mining 2018 - Grand Hyatt, Melbourne, Australia Duration: 3 Jun 2018 → 6 Jun 2018 Conference number: 22nd http://pakdd2018.medmeeting.org/Content/92892 https://link.springer.com/book/10.1007/978-3-319-93034-3 (Proceedings) |
Publication series
| Name | Lecture Notes in Computer Science |
|---|---|
| Publisher | Springer |
| Volume | 10939 |
| ISSN (Print) | 0302-9743 |
| ISSN (Electronic) | 1611-3349 |
Conference
| Conference | Pacific-Asia Conference on Knowledge Discovery and Data Mining 2018 |
|---|---|
| Abbreviated title | PAKDD 2018 |
| Country/Territory | Australia |
| City | Melbourne |
| Period | 3/06/18 → 6/06/18 |
| Internet address |
Keywords
- Pointer network
- Question generation
- Sequence to sequence modeling
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver