Parallel processing of group by-before-join queries in cluster architecture

D. Taniar, J. W. Rahayu

    Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

    4 Citations (Scopus)

    Abstract

    SQL queries in the real world are replete with group-by and join operations. This type of queries is often known as GroupBy-Join queries. In some GroupBy-Join queries, it is desirable to perform group-by before join in order to achieve better performance. This subset of GroupBy-Join queries is called GroupBy-Before-Join queries. In this paper, we present a study on the parallelization of GroupBy-Before-Join queries, particularly by exploiting cluster architectures. From our study, we have learned that, in parallel query optimization, processing group-by operations as early as possible is not always desirable. On many occasions, performing data distribution first, before group-by, offers performance advantages. In this study, we also describe our cluster-based scheme.

    Original languageEnglish
    Title of host publicationProceedings - 1st IEEE/ACM International Symposium on Cluster Computing and the Grid, CCGrid 2001
    PublisherIEEE, Institute of Electrical and Electronics Engineers
    Pages178-185
    Number of pages8
    ISBN (Print)0769510108, 9780769510101
    DOIs
    Publication statusPublished - 2001
    Event1st IEEE/ACM International Symposium on Cluster Computing and the Grid, CCGrid 2001 - Brisbane, QLD, Australia
    Duration: 15 May 200118 May 2001

    Conference

    Conference1st IEEE/ACM International Symposium on Cluster Computing and the Grid, CCGrid 2001
    Country/TerritoryAustralia
    CityBrisbane, QLD
    Period15/05/0118/05/01

    Cite this