Persian-Spanish low-resource statistical machine translation through english as pivot language

Benyamin Ahmadnia, Javier Serrano, Gholamreza Haffari

    Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

    8 Citations (Scopus)

    Abstract

    This paper is an attempt to exclusively focus on investigating the pivot language technique in which a bridging language is utilized to increase the quality of the Persian-Spanish low-resource Statistical Machine Translation (SMT). In this case, English is used as the bridging language, and the Persian-English SMT is combined with the English-Spanish one, where the relatively large corpora of each may be used in support of the Persian-Spanish pairing. Our results indicate that the pivot language technique outperforms the direct SMT processes currently in use between Persian and Spanish. Furthermore, we investigate the sentence translation pivot strategy and the phrase translation in turn, and demonstrate that, in the context of the Persian-Spanish SMT system, the phrase-level pivoting outperforms the sentence-level pivoting. Finally we suggest a method called combination model in which the standard direct model and the best triangulation pivoting model are blended in order to reach a high-quality translation.

    Original languageEnglish
    Title of host publicationInternational Conference on Recent Advances in Natural Language Processing 2017
    Subtitle of host publicationMeet Deep Learning, RANLP 2017 - Proceedings
    EditorsRuslan Mitkov , Galia Angelova
    Place of PublicationPA USA
    PublisherAssociation for Computational Linguistics (ACL)
    Pages24-30
    Number of pages7
    ISBN (Electronic)9789544520489
    DOIs
    Publication statusPublished - 2017
    EventInternational Conference on Recent Advances in Natural Language Processing 2017 - Varna, Bulgaria
    Duration: 2 Sep 20178 Sep 2017
    Conference number: 11th

    Conference

    ConferenceInternational Conference on Recent Advances in Natural Language Processing 2017
    Abbreviated titleRANLP 2017
    CountryBulgaria
    CityVarna
    Period2/09/178/09/17

    Cite this