Abstract
Given the new challenges of open and unsupervised information extraction, there is a need to identify important and relevant knowledge structures (concepts and relationships) in the vast amount of extracted data and to filter the noise that results from unsupervised information extraction. This is generally referred to as the ontologization task. This paper uses measures from graph theory to identify these key elements such as Page Rank, Betweenness, and Degree. We also propose a combination of metrics for ranking concepts and relationships. Our approach shows effective results in terms of precision compared to other standard measures for weighting concepts and relationships such as TF*IDF or frequency of co-occurrences.
Original language | English |
---|---|
Title of host publication | 26th Annual ACM Symposium on Applied Computing, SAC 2011 |
Pages | 1687-1692 |
Number of pages | 6 |
DOIs | |
Publication status | Published - 23 Jun 2011 |
Externally published | Yes |
Event | ACM Symposium on Applied Computing 2011 - Tunghai University, Taichung, Taiwan Duration: 21 Mar 2011 → 24 Mar 2011 Conference number: 26th http://www.sigapp.org/sac/sac2011/ https://dl.acm.org/doi/proceedings/10.1145/1982185 (Proceedings) |
Conference
Conference | ACM Symposium on Applied Computing 2011 |
---|---|
Abbreviated title | SAC 2011 |
Country/Territory | Taiwan |
City | Taichung |
Period | 21/03/11 → 24/03/11 |
Internet address |
Keywords
- concept and relation importance
- graph theory
- metrics
- ontologization
- ontology
- precision