A scalable topic-based open source search engine

Wray Buntine, Jaakko Löfström, Jukka Perkiö, Sami Perttu, Vladimir Poroshin, Tomi Silander, Henry Tirri, Antti Tuominen, Ville Tuulos

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

25 Citations (Scopus)


Site-based or topic-specific search engines work with mixed success because of the general difficulty of the information retrieval task, and the lack of good link information to allow authorities to be identified. We are advocating an open source approach to the problem due to its scope and need for software components. We have adopted a topicbased search engine because it represents the next generation of capability. This paper outlines our scalable system for site-based or topic-specific search, and demonstrates the developing system on a small 250,000 document collection of EU and UN web pages.

Original languageEnglish
Title of host publicationProceedings - IEEE/WIC/ACM International Conference on Web Intelligence, WI 2004
EditorsN. Zhong, H. Tirri, Y. Yao, L. Zhou
Number of pages7
Publication statusPublished - 1 Dec 2004
EventIEEE/WIC/ACM international Conference on Web Intelligence 2004 - Beijing, China
Duration: 20 Sept 200424 Sept 2004


ConferenceIEEE/WIC/ACM international Conference on Web Intelligence 2004
Abbreviated titleWI 2004

Cite this