Finding web document associations using frequent pairs of adjacent words

Jason Yong Jin Tee, Lay Ki Soon, Bali Ranaivo-Malançon

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review


This paper presents an approach to find associations between Web documents using collocated word pairs. Given two Web documents which are connected via a hyperlink, we attempt to find the contextual association of these two Web pages by using collocations of word pairs from a statistical point of view. Our preliminary experimental results show that our approach is able to extract fairly coherent word pairs to derive associations between hyperlinked Web documents.

Original languageEnglish
Title of host publicationKnowledge Technology - Third Knowledge Technology Week, KTW 2011, Revised Selected Papers
Number of pages4
Publication statusPublished - 2012
Externally publishedYes
EventKnowledge Technology Week 2011 - Kajang, Malaysia
Duration: 18 Jul 201122 Jul 2011
Conference number: 3rd (SpringerLink - entire proceedings)

Publication series

NameCommunications in Computer and Information Science
Volume295 CCIS
ISSN (Print)1865-0929


ConferenceKnowledge Technology Week 2011
Abbreviated titleKTW 2011
Internet address


  • collocation
  • document association
  • frequent word pairs

Cite this