kNNVWC: An efficient k-nearest neighbors approach based on various-widths clustering

Abdul Mohsen Almalawi, Adil Fahad, Zahir Tari, Muhammad Aamir Cheema, Ibrahim Khalil

    Research output: Contribution to journalArticleResearchpeer-review

    44 Citations (Scopus)

    Abstract

    The k-nearest neighbor approach (kNN) has been extensively used as a powerful non-parametric technique in many scientific and engineering applications. However, this approach incurs a large computational cost. Hence, this issue has become an active research field. In this work, a novel kNN approach based on various-widths clustering, named kNNVWC, to efficiently find kNNs for a query object from a given data set, is presented. kNNVWC does clustering using various widths, where a data set is clustered with a global width first and each produced cluster that meets the predefined criteria is recursively clustered with its own local width that suits its distribution. This reduces the clustering time, in addition to balancing the number of produced clusters and their respective sizes. Maximum efficiency is achieved by using triangle inequality to prune unlikely clusters. Experimental results demonstrate that kNNVWC performs well in finding kNNs for query objects compared to a number of kNN search algorithms, especially for a data set with high dimensions, various distributions and large size.
    Original languageEnglish
    Article number7166319
    Pages (from-to)68 - 81
    Number of pages14
    JournalIEEE Transactions on Knowledge and Data Engineering
    Volume28
    Issue number1
    DOIs
    Publication statusPublished - 2016

    Keywords

    • Clustering
    • K-nearest neighbour
    • High dimensionality
    • Performance
    • SCADA

    Cite this