Context-dependent multilingual lexical lookup for under-resourced languages

Lian Tze Lim, Lay Ki Soon, Tek Yong Lim, Enya Kong Tang, Bali Ranaivo-Malançon

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

2 Citations (Scopus)

Abstract

Current approaches for word sense disambiguation and translation selection typically require lexical resources or large bilingual corpora with rich information fields and annotations, which are often infeasible for under-resourced languages. We extract translation context knowledge from a bilingual comparable corpora of a richer-resourced language pair, and inject it into a multilingual lexicon. The multilingual lexicon can then be used to perform context-dependent lexical lookup on texts of any language, including under-resourced ones. Evaluations on a prototype lookup tool, trained on a English-Malay bilingual Wikipedia corpus, show a precision score of 0.65 (baseline 0.55) and mean reciprocal rank score of 0.81 (baseline 0.771). Based on the early encouraging results, the context-dependent lexical lookup tool may be developed further into an intelligent reading aid, to help users grasp the gist of a second or foreign language text.

Original languageEnglish
Title of host publicationShort Papers
PublisherAssociation for Computational Linguistics (ACL)
Pages294-299
Number of pages6
ISBN (Print)9781937284510
Publication statusPublished - 2013
Externally publishedYes
EventAnnual Meeting of the Association of Computational Linguistics 2013 - Sofia, Bulgaria
Duration: 4 Aug 20139 Aug 2013
Conference number: 51st
http://www.acl2013.org/site/
https://www.aclweb.org/anthology/events/acl-2013/ (Proceedings)

Conference

ConferenceAnnual Meeting of the Association of Computational Linguistics 2013
Abbreviated titleACL 2013
Country/TerritoryBulgaria
CitySofia
Period4/08/139/08/13
Internet address

Cite this