Anomaly detection in streaming nonstationary temporal data

Priyanga Dilini Talagala, Rob J. Hyndman, Kate Smith-Miles, Sevvandi Kandanaarachchi, Mario A. Muñoz

Research output: Contribution to journalArticleResearchpeer-review

37 Citations (Scopus)

Abstract

This article proposes a framework that provides early detection of anomalous series within a large collection of nonstationary streaming time-series data. We define an anomaly as an observation, that is, very unlikely given the recent distribution of a given system. The proposed framework first calculates a boundary for the system’s typical behavior using extreme value theory. Then a sliding window is used to test for anomalous series within a newly arrived collection of series. The model uses time series features as inputs, and a density-based comparison to detect any significant changes in the distribution of the features. Using various synthetic and real world datasets, we demonstrate the wide applicability and usefulness of our proposed framework. We show that the proposed algorithm can work well in the presence of noisy nonstationarity data within multiple classes of time series. This framework is implemented in the open source R package oddstream. R code and data are available in the online supplementary materials.

Original languageEnglish
Pages (from-to)13-27
Number of pages15
JournalJournal of Computational and Graphical Statistics
Volume29
Issue number1
DOIs
Publication statusPublished - 2020

Keywords

  • Concept drift
  • Extreme value theory
  • Feature-based time series analysis
  • Kernel-based density estimation
  • Multivariate time series
  • Outlier detection

Cite this