Understanding semantic change of words over centuries

Derry Tanti Wijaya, Reyyan Yeniterzi

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

79 Citations (Scopus)

Abstract

In this paper, we propose to model and analyze changes that occur to an entity in terms of changes in the words that co-occur with the entity over time. We propose to do an in-depth analysis of how this co-occurrence changes over time, how the change influences the state (semantic, role) of the entity, and how the change may correspond to events occurring in the same period of time. We propose to identify clusters of topics surrounding the entity over time using Topics-Over-Time (TOT) and k-means clustering. We conduct this analysis on Google Books Ngram dataset. We show how clustering words that co-occur with an entity of interest in 5-grams can shed some lights to the nature of change that occurs to the entity and identify the period for which the change occurs. We find that the period identified by our model precisely coincides with events in the same period that correspond to the change that occurs.

Original languageEnglish
Title of host publicationCIKM 2011 Glasgow
Subtitle of host publicationDETECT'11 - Proceedings of the 2011 International Workshop on DETecting and Exploiting Cultural diversiTy on the Social Web
Pages35-40
Number of pages6
DOIs
Publication statusPublished - 2011
Externally publishedYes
EventInternational Workshop on DETecting and Exploiting Cultural diversiTy on the Social Web 2011 - Glasgow, United Kingdom
Duration: 24 Oct 201124 Oct 2011

Workshop

WorkshopInternational Workshop on DETecting and Exploiting Cultural diversiTy on the Social Web 2011
Abbreviated titleDETECT 2011
Country/TerritoryUnited Kingdom
CityGlasgow
Period24/10/1124/10/11

Keywords

  • event detection
  • semantic change
  • topic clustering
  • topic transition over time

Cite this