From raw text to morphological rules for Iban morphological analyser

Suhaila Saee, Lay Ki Soon, Tek Yong Lim, Bali Ranaivo-Malançon, Enya Kong Tang

    Research output: Chapter in Book/Report/Conference proceedingConference PaperOtherpeer-review

    Abstract

    To extend a complete workflow of automatic acquisition of morphological rules for morphological analyser, we propose a semi-automatic workflow for under-resourced language, which is Iban language. The workflow focuses in determining the rules to be used for building Iban morphological analyser without prior knowledge of language-specific morphological rules. This work introduces three main steps in acquiring the rules from the under-resourced language, which are morphological rules extraction, validation of the extracted rules and evaluation of the generated rules. From the proposed workflow, 25 rules were generated from 744 rules candidate. This work has achieved 76% of precision and 99% of recall. We believe the workflow will assist other researchers to build morphological analyser with the validated morphological rules for the under-resourced languages.

    Original languageEnglish
    Title of host publication2012 International Conference on Asian Language Processing
    Pages21-24
    Number of pages4
    DOIs
    Publication statusPublished - 2012
    EventInternational Conference on Asian Language Processing (IALP) 2012 - Hanoi, Vietnam
    Duration: 13 Nov 201215 Nov 2012
    https://ieeexplore.ieee.org/xpl/conhome/6472868/proceeding (Proceedings)

    Conference

    ConferenceInternational Conference on Asian Language Processing (IALP) 2012
    Abbreviated titleIALP 2012
    Country/TerritoryVietnam
    CityHanoi
    Period13/11/1215/11/12
    Internet address

    Keywords

    • Morphological analyzer
    • Morphological rules
    • Rules extraction
    • Under-resourced language

    Cite this