Abstract
Current approaches for word sense disambiguation and translation selection typically require lexical resources or large bilingual corpora with rich information fields and annotations, which are often infeasible for under-resourced languages. We extract translation context knowledge from a bilingual comparable corpora of a richer-resourced language pair, and inject it into a multilingual lexicon. The multilingual lexicon can then be used to perform context-dependent lexical lookup on texts of any language, including under-resourced ones. Evaluations on a prototype lookup tool, trained on a English-Malay bilingual Wikipedia corpus, show a precision score of 0.65 (baseline 0.55) and mean reciprocal rank score of 0.81 (baseline 0.771). Based on the early encouraging results, the context-dependent lexical lookup tool may be developed further into an intelligent reading aid, to help users grasp the gist of a second or foreign language text.
Original language | English |
---|---|
Title of host publication | Short Papers |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 294-299 |
Number of pages | 6 |
ISBN (Print) | 9781937284510 |
Publication status | Published - 2013 |
Externally published | Yes |
Event | Annual Meeting of the Association of Computational Linguistics 2013 - Sofia, Bulgaria Duration: 4 Aug 2013 → 9 Aug 2013 Conference number: 51st http://www.acl2013.org/site/ https://www.aclweb.org/anthology/events/acl-2013/ (Proceedings) |
Conference
Conference | Annual Meeting of the Association of Computational Linguistics 2013 |
---|---|
Abbreviated title | ACL 2013 |
Country/Territory | Bulgaria |
City | Sofia |
Period | 4/08/13 → 9/08/13 |
Internet address |