Skip to main navigation Skip to search Skip to main content

A relevance weighted ensemble model for anomaly detection in switching data streams

Mahsa Salehi, Christopher A. Leckie, Masud Moshtaghi, Tharshan Vaithianathan

    Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

    Abstract

    Anomaly detection in data streams plays a vital role in online data mining applications. A major challenge for anomaly detection is the dynamically changing nature of many monitoring environments. This causes a problem for traditional anomaly detection techniques in data streams, which assume a relatively static monitoring environment. In an environment that is intermittently changing (known as switching data streams), static approaches can yield a high error rate in terms of false positives. To cope with dynamic environments, we require an approach that can learn from the history of normal behaviour in data streams, while accounting for the fact that not all time periods in the past are equally relevant. Consequently, we have proposed a relevance-weighted ensemble model for learning normal behaviour, which forms the basis of our anomaly detection scheme. The advantage of this approach is that it can improve the accuracy of detection by using relevant history, while remaining computationally efficient. Our solution provides a novel contribution through the use of ensemble techniques for anomaly detection in switching data streams. Our empirical results on real and synthetic data streams show that we can achieve substantial improvements compared to a recent anomaly detection algorithm for data streams.
    Original languageEnglish
    Title of host publicationAdvances in Knowledge Discovery and Data Mining: 18th Pacific-Asia Conference (Proceedings (PAKDD 2014)
    Subtitle of host publicationTainan, Taiwan, May 13-16, 2014 Proceedings, Part II
    EditorsVincent S. Tseng, Tu Bao Ho, Zhi-Hua Zhou, Arbee L. P. Chen, Hung-Yu Kao
    Place of PublicationCham [Switzerland]
    PublisherSpringer
    Pages461 - 473
    Number of pages13
    ISBN (Electronic)9783319066059
    ISBN (Print)9783319066042
    DOIs
    Publication statusPublished - 2014
    EventPacific-Asia Conference on Knowledge Discovery and Data Mining 2014 - Tainan, Taiwan
    Duration: 13 May 201416 May 2014
    Conference number: 18th
    https://sites.google.com/site/pakdd2014/
    https://link.springer.com/book/10.1007/978-3-319-06608-0 (Proceedings)

    Conference

    ConferencePacific-Asia Conference on Knowledge Discovery and Data Mining 2014
    Abbreviated titlePAKDD 2014
    Country/TerritoryTaiwan
    CityTainan
    Period13/05/1416/05/14
    Internet address

    Keywords

    • Anomaly detection
    • Ensemble models
    • Data streams

    Cite this