Multi-faceted information retrieval system for large scale email archives

Jukka Perkiö, Ville Tuulos, Wray Buntine, Henry Tirri

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

3 Citations (Scopus)

Abstract

We profile a system for search and analysis of large-scale email archives. The system builds around four facets: Content-based search engine, statistical topic model, automatically inferred social networks and time-series analysis. The facets correspond to the types of information available in email data. The presented system allows chaining or combining the facets flexibly. Results of one facet may be used as input to another, yielding remarkable combinatorial power. In information retrieval point of view, the system provides support for exploration, approximate textual searches and data visualization. We present some experimental results based on a large real-world email corpus.

Original languageEnglish
Title of host publicationProceedings - 2005 IEEE/WIC/ACM InternationalConference on Web Intelligence, WI 2005
PublisherIEEE, Institute of Electrical and Electronics Engineers
Pages557-564
Number of pages8
Volume2005
ISBN (Print)076952415X, 9780769524153
DOIs
Publication statusPublished - 1 Dec 2005
EventIEEE/WIC/ACM international Conference on Web Intelligence 2005 - Compiegne Cedex, France
Duration: 19 Sept 200522 Sept 2005

Conference

ConferenceIEEE/WIC/ACM international Conference on Web Intelligence 2005
Abbreviated titleWI 2005
Country/TerritoryFrance
CityCompiegne Cedex,
Period19/09/0522/09/05

Cite this