Towards open ontology learning and filtering

Amal Zouaq, Dragan Gasevic, Marek Hatala

Research output: Contribution to journalArticleResearchpeer-review

58 Citations (Scopus)

Abstract

Open ontology learning is the process of extracting a domain ontology from a knowledge source in an unsupervised way. Due to its unsupervised nature, it requires filtering mechanisms to rate the importance and correctness of the extracted knowledge. This paper presents OntoCmaps, a domain-independent and open ontology learning tool that extracts deep semantic representations from corpora. OntoCmaps generates rich conceptual representations in the form of concept maps and proposes an innovative filtering mechanism based on metrics from graph theory. Our results show that using metrics such as Betweenness, PageRank, Hits and Degree centrality outperforms the results of standard text-based metrics (TF-IDF, term frequency) for concept identification. We propose voting schemes based on these metrics that provide a good performance in relationship identification, which again provides better results (in terms of precision and F-measure) than other traditional metrics such as frequency of co-occurrences. The approach is evaluated against a gold standard and is compared to the ontology learning tool Text2Onto. The OntoCmaps generated ontology is more expressive than Text2Onto ontology especially in conceptual relationships and leads to better results in terms of precision, recall and F-measure.

Original languageEnglish
Pages (from-to)1064-1081
Number of pages18
JournalInformation Systems
Volume36
Issue number7
DOIs
Publication statusPublished - 1 Nov 2011
Externally publishedYes

Keywords

  • Filtering
  • Graph theory
  • Metrics
  • Ontology learning

Cite this