Projects per year
Abstract
Previous works mostly focus on either multilingual or multi-domain aspects of neural machine translation (NMT). This paper investigates whether the domain information can be transferred across languages on the composition of multi-domain and multilingual NMT, particularly for the incomplete data condition where in-domain bitext is missing for some language pairs. Our results in the curated leave-one-domain-out experiments show that multi-domain multilingual (MDML) NMT can boost zero-shot translation performance up to +10 gains on BLEU, as well as aid the generalisation of multi-domain NMT to the missing domain. We also explore strategies for effective integration of multilingual and multi-domain NMT, including language and domain tag combination and auxiliary task training. We find that learning domain-aware representations and adding target-language tags to the encoder leads to effective MDML-NMT.
Original language | English |
---|---|
Title of host publication | WMT 2022 - Seventh Conference on Machine Translation - Proceedings of the Conference |
Place of Publication | Stroudsburg PA USA |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 381-396 |
Number of pages | 16 |
ISBN (Electronic) | 9781959429296 |
Publication status | Published - 2022 |
Event | Conference on Machine Translation 2022 - Abu Dhabi, United Arab Emirates Duration: 7 Dec 2022 → 8 Dec 2022 Conference number: 7th https://aclanthology.org/volumes/2022.wmt-1/ (Proceedings) |
Conference
Conference | Conference on Machine Translation 2022 |
---|---|
Abbreviated title | WMT 2022 |
Country/Territory | United Arab Emirates |
City | Abu Dhabi |
Period | 7/12/22 → 8/12/22 |
Internet address |
|
Projects
- 1 Active
-
Exploiting Context in Multilingual Understanding and Generation
Australian Research Council (ARC)
20/11/20 → 25/12/25
Project: Research