Scalable Variational Inference for Extracting Hierarchical Phrase-based Translation Rules

Baskaran Sankaran, Gholamreza Haffari, Anoop Sarkar

    Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

    1 Citation (Scopus)

    Abstract

    We present a Variational-Bayes model for learning rules for the Hierarchical phrase-based model directly from the phrasal alignments. Our model is an alternative to heuristic rule extraction in hierarchical phrase-based translation (Chiang, 2007), which uniformly distributes the probability mass to the extracted rules locally. In contrast, in our approach the probability assigned to a rule is globally determined by its contribution towards all phrase pairs and results in a sparser rule set. We also propose a distributed framework for efficiently running inference for realistic MT corpora. Our experiments translating Korean, Arabic and Chinese into English demonstrate that they are able to exceed or retain the performance of baseline hierarchical phrase-based models.

    Original languageEnglish
    Title of host publicationSixth International Joint Conference on Natural Language Processing (ICNLP 2013), Proceedings of the Main Conference
    EditorsRuslan Mitkov, Jong C. Park
    Place of PublicationStroudsburg PA USA
    PublisherAssociation for Computational Linguistics (ACL)
    Pages438-446
    Number of pages9
    ISBN (Electronic)9784990734800
    ISBN (Print)9784990734800
    Publication statusPublished - 2013
    EventInternational Joint Conference on Natural Language Processing 2013 - Nagoya, Japan
    Duration: 14 Oct 201318 Oct 2013
    Conference number: 6th

    Conference

    ConferenceInternational Joint Conference on Natural Language Processing 2013
    Abbreviated titleIJCNLP 2013
    Country/TerritoryJapan
    CityNagoya
    Period14/10/1318/10/13

    Cite this