Abstract
A noun compound (NC) is a sequence of two or more nouns (entities) acting as a single noun entity that encodes implicit semantic relation between its noun constituents. Given an NC such as 'headache pills' and possible paraphrases such as: 'pills that induce headache' or 'pills that relieve head-ache' can we learn to choose which verb: 'induce' or 'relieve' that best describes the semantic relation encoded in 'headache pills'? In this paper, we describe our approaches to rank human-proposed paraphrasing verbs of NCs. Our contribution is a novel approach that uses two-step process of clustering similar NCs and then labeling the best paraphrasing verb as the most prototypical verb in the cluster. The approach performs the best with an average Spearman's rank correlation of 0.55. This approach, while being computationally simpler, gives a better ranking than the current state of the art. The result shows the potential of our approach for finding implicit relations between entities especially when the relations are not explicit in the context in which the entities appear, rather they are implicit in the relationship between its constituents.
Original language | English |
---|---|
Title of host publication | CIKM 2011 Glasgow |
Subtitle of host publication | SMER'11 - Proceedings of the 1st International Workshop on Search and Mining Entity-Relationship Data |
Pages | 9-14 |
Number of pages | 6 |
DOIs | |
Publication status | Published - 2011 |
Externally published | Yes |
Event | International Workshop on Search and Mining Entity-Relationship Data 2011 - Glasgow, United Kingdom Duration: 28 Oct 2011 → 28 Oct 2011 Conference number: 1st |
Workshop
Workshop | International Workshop on Search and Mining Entity-Relationship Data 2011 |
---|---|
Abbreviated title | SMER 2011 |
Country/Territory | United Kingdom |
City | Glasgow |
Period | 28/10/11 → 28/10/11 |
Keywords
- clustering
- noun compounds
- paraphrasing
- ranking
- semantic relation