Experiments in query paraphrasing for information retrieval

Ingrid Zukerman, Bhavani Raskutti, Ying Ying Wen

    Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

    4 Citations (Scopus)


    We investigate the effect of paraphrase generation on document retrieval performance. Specifically, we describe experiments where three information sources are used to generate lexical paraphrases of queries posed to the Internet. These information sources are: WordNet, a Webster-based thesaurus, and a combination of Webster and WordNet. Corpus-based information and wordsimilarity information are then used to rank the paraphrases. We evaluated our mechanism using 404 queries whose answers reside in the LA Times subset of the TREC-9 corpus. Our experiments show that query paraphrasing improves retrieval performance, and that performance is influenced both by the number of paraphrases generated for a query and by their quality. Specifically, the best performance was obtained usingWordNet, which improves document recall by 14% and increases the number of questions that can be answered by 8%.
    Original languageEnglish
    Title of host publicationAI 2002: Advances in Artificial Intelligence
    Subtitle of host publication15th Australian Joint Conference on Artificial Intelligence Canberra, Australia, December 2-6, 2002 Proceedings
    EditorsBob McKay, John Slaney
    Place of PublicationBerlin Germany
    Number of pages12
    ISBN (Print)3540001972
    Publication statusPublished - 2002
    EventAustralasian Joint Conference on Artificial Intelligence 2002 - Canberra, Australia
    Duration: 2 Dec 20026 Dec 2002
    Conference number: 15th
    https://link.springer.com/book/10.1007/3-540-36187-1 (Proceedings)

    Publication series

    NameLecture Notes in Computer Science
    ISSN (Print)0302-9743


    ConferenceAustralasian Joint Conference on Artificial Intelligence 2002
    Abbreviated titleAI 2002
    Internet address

    Cite this