This paper presents the development of a novel personal concept-based multilingual Web content mining system. Multilingual linguistic knowledge required by multilingual Web content mining is made available by encoding all multilingual concept-term relationships within a multilingual concept space using self-organising map. With this linguistic knowledge base, a personal space of interest is generated to reveal the conceptual content of a user's multiple topics of interest using the user's bookmark file. To personalise the multilingual Web content mining process, a concept-based Web crawler is developed to automatically gather multilingual web documents that are relevant to the user's topics of interest As such, user-oriented concept-focused knowledge discovery in the multilingual Web is facilitated.
|Number of pages||10|
|Journal||Lecture Notes in Computer Science|
|Publication status||Published - 26 Sep 2005|
|Event||International Conference on Computational Science and Its Applications - ICCSA 2005 - Singapore, Singapore|
Duration: 9 May 2005 → 12 May 2005