Abstract
We profile a system for search and analysis of large-scale email archives. The system builds around four facets: Content-based search engine, statistical topic model, automatically inferred social networks and time-series analysis. The facets correspond to the types of information available in email data. The presented system allows chaining or combining the facets flexibly. Results of one facet may be used as input to another, yielding remarkable combinatorial power. In information retrieval point of view, the system provides support for exploration, approximate textual searches and data visualization. We present some experimental results based on a large real-world email corpus.
Original language | English |
---|---|
Title of host publication | Proceedings - 2005 IEEE/WIC/ACM InternationalConference on Web Intelligence, WI 2005 |
Publisher | IEEE, Institute of Electrical and Electronics Engineers |
Pages | 557-564 |
Number of pages | 8 |
Volume | 2005 |
ISBN (Print) | 076952415X, 9780769524153 |
DOIs | |
Publication status | Published - 1 Dec 2005 |
Event | IEEE/WIC/ACM international Conference on Web Intelligence 2005 - Compiegne Cedex, France Duration: 19 Sept 2005 → 22 Sept 2005 |
Conference
Conference | IEEE/WIC/ACM international Conference on Web Intelligence 2005 |
---|---|
Abbreviated title | WI 2005 |
Country/Territory | France |
City | Compiegne Cedex, |
Period | 19/09/05 → 22/09/05 |