Document context neural machine translation with memory networks

    Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

    18 Citations (Scopus)

    Abstract

    We present a document-level neural machine translation model which takes both
    source and target document context into account using memory networks. We
    model the problem as a structured prediction problem with interdependencies
    among the observed and hidden variables, i.e., the source sentences and their unobserved target translations in the document. The resulting structured prediction problem is tackled with a neural translation model equipped with two memory components, one each for the source and target side, to capture the documental interdependencies. We train the model end-to-end, and propose an iterative decoding algorithm based on block coordinate descent. Experimental results of English translations from French, German, and Estonian documents show that our model is effective in exploiting both source and target document context, and statistically significantly outperforms the previous work in terms of BLEU and METEOR.
    Original languageEnglish
    Title of host publicationACL 2018 - The 56th Annual Meeting of the Association for Computational Linguistics
    Subtitle of host publicationProceedings of the Conference, Vol. 1 (Long Papers)
    EditorsIryna Gurevych, Yusuke Miyao
    Place of PublicationStroudsburg PA USA
    PublisherAssociation for Computational Linguistics (ACL)
    Pages1275-1284
    Number of pages10
    ISBN (Print)9781948087322
    Publication statusPublished - 2018
    EventAnnual Meeting of the Association of Computational Linguistics 2018 - Melbourne, Australia
    Duration: 15 Jul 201820 Jul 2018
    Conference number: 56th
    https://aclanthology.info/events/acl-2018

    Conference

    ConferenceAnnual Meeting of the Association of Computational Linguistics 2018
    Abbreviated titleACL 2018
    CountryAustralia
    CityMelbourne
    Period15/07/1820/07/18
    Internet address

    Cite this

    Maruf, S., & Haffari, G. (2018). Document context neural machine translation with memory networks. In I. Gurevych, & Y. Miyao (Eds.), ACL 2018 - The 56th Annual Meeting of the Association for Computational Linguistics: Proceedings of the Conference, Vol. 1 (Long Papers) (pp. 1275-1284). Association for Computational Linguistics (ACL). http://aclweb.org/anthology/P18-1118