Abstract
Sequence to sequence (SEQ2SEQ) models often lack diversity in their generated translations. This can be attributed to the limitation of SEQ2SEQ models in capturing lexical and syntactic variations in a parallel corpus resulting from different styles, genres, topics, or ambiguity of the translation process. In this paper, we develop a novel sequence to sequence mixture (S2SMIX) model that improves both translation diversity and quality by adopting a committee of specialized translation models rather than a single translation model. Each mixture component selects its own training dataset via optimization of the marginal log-likelihood, which leads to a soft clustering of the parallel corpus. Experiments on four language pairs demonstrate the superiority of our mixture model compared to a SEQ2SEQ baseline with standard or diversity-boosted beam search. Our mixture model uses negligible additional parameters and incurs no extra computation cost during decoding.
| Original language | English |
|---|---|
| Title of host publication | CoNLL 2018 - The 22nd Conference on Computational Natural Language Learning - Proceedings of the Conference |
| Editors | Miikka Silfverberg |
| Place of Publication | Stroudsburg PA USA |
| Publisher | Association for Computational Linguistics (ACL) |
| Pages | 583-592 |
| Number of pages | 10 |
| ISBN (Electronic) | 9781948087728 |
| Publication status | Published - 2018 |
| Event | Conference on Natural Language Learning 2018 - Brussels, Belgium Duration: 31 Oct 2018 → 1 Nov 2018 Conference number: 22nd https://www.conll.org/2018 https://www.aclweb.org/anthology/volumes/K18-1/ (Proceedings) |
Publication series
| Name | CoNLL 2018 - 22nd Conference on Computational Natural Language Learning, Proceedings |
|---|
Conference
| Conference | Conference on Natural Language Learning 2018 |
|---|---|
| Abbreviated title | CoNLL 2018 |
| Country/Territory | Belgium |
| City | Brussels |
| Period | 31/10/18 → 1/11/18 |
| Internet address |
Projects
- 1 Finished
-
Learning Deep Semantics for Automatic Translation between Human Languages
Haffari, R. (Primary Chief Investigator (PCI)), Cohn, T. (Chief Investigator (CI)) & Blunsom, P. (Partner Investigator (PI))
ARC - Australian Research Council
1/01/16 → 31/12/18
Project: Research
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver