From Tf-Idf to learning-to-rank: An overview

Muhammad Ibrahim, Manzur Murshed

    Research output: Chapter in Book/Report/Conference proceedingChapter (Book)Researchpeer-review

    4 Citations (Scopus)

    Abstract

    Ranking a set of documents based on their relevances with respect to a given query is a central problem of information retrieval (IR). Traditionally people have been using unsupervised scoring methods like tf-idf, BM25, Language Model etc., but recently supervised machine learning framework is being used successfully to learn a ranking function, which is called learning-to-rank (LtR) problem. There are a few surveys on LtR in the literature; but these reviews provide very little assistance to someone who, before delving into technical details of different algorithms, wants to have a broad understanding of LtR systems and its evolution from and relation to the traditional IR methods. This chapter tries to address this gap in the literature. Mainly the following aspects are discussed: the fundamental concepts of IR, the motivation behind LtR, the evolution of LtR from and its relation to the traditional methods, the relationship between LtR and other supervised machine learning tasks, the general issues pertaining to an LtR algorithm, and the theory of LtR.

    Original languageEnglish
    Title of host publicationHandbook of Research on Innovations in Information Retrieval, Analysis, and Management
    EditorsJorge Tiago Martins, Andreea Molnar
    Place of PublicationHershey PA, USA
    PublisherIGI Global
    Pages62-109
    Number of pages48
    ISBN (Electronic)9781466688346
    ISBN (Print)9781466688339
    DOIs
    Publication statusPublished - 2016

    Cite this