Autoregressive model-based fuzzy clustering and its application for detecting information redundancy in air pollution monitoring networks

Pierpaolo D'Urso, Dario Di Lallo, Elizabeth Ann Maharaj

Research output: Contribution to journalArticleResearchpeer-review

Abstract

Fuzzy clustering enables the simultaneous membership of objects in two or more clusters. This is particularly pertinent where time series are concerned, because very often patterns of time series change over time. Thus, a time series might belong to different clusters over different periods of time, in which case, crisp clustering is unable to capture this multi-cluster membership. In this paper, we adopt a Fuzzy C-Medoids approach to clustering time series based on autoregressive estimates of models fitted to the time series. We illustrate very good performance of this approach in a range of simulation studies. By means of two applications, we also show the usefulness of this clustering approach in the air pollution monitoring, by considering air pollution time series, i.e., CO time series, CO2 time series and NO time series monitored on world and urban scales. In particular, we show that, by considering in the clustering process, the autoregressive representation of these air pollution time series, we are able to detect possible information redundancy in the monitoring networks and then, decreasing the number of monitoring stations, to reduce the monitoring costs and then to increase the monitoring efficiency of the networks.
Original languageEnglish
Pages (from-to)83 - 131
Number of pages49
JournalSoft Computing
Volume17
Issue number1
DOIs
Publication statusPublished - 2013

Cite this

@article{4ef9bc28e47e4b32adafa6457377ecda,
title = "Autoregressive model-based fuzzy clustering and its application for detecting information redundancy in air pollution monitoring networks",
abstract = "Fuzzy clustering enables the simultaneous membership of objects in two or more clusters. This is particularly pertinent where time series are concerned, because very often patterns of time series change over time. Thus, a time series might belong to different clusters over different periods of time, in which case, crisp clustering is unable to capture this multi-cluster membership. In this paper, we adopt a Fuzzy C-Medoids approach to clustering time series based on autoregressive estimates of models fitted to the time series. We illustrate very good performance of this approach in a range of simulation studies. By means of two applications, we also show the usefulness of this clustering approach in the air pollution monitoring, by considering air pollution time series, i.e., CO time series, CO2 time series and NO time series monitored on world and urban scales. In particular, we show that, by considering in the clustering process, the autoregressive representation of these air pollution time series, we are able to detect possible information redundancy in the monitoring networks and then, decreasing the number of monitoring stations, to reduce the monitoring costs and then to increase the monitoring efficiency of the networks.",
author = "Pierpaolo D'Urso and {Di Lallo}, Dario and Maharaj, {Elizabeth Ann}",
year = "2013",
doi = "10.1007/s00500-012-0905-6",
language = "English",
volume = "17",
pages = "83 -- 131",
journal = "Soft Computing",
issn = "1432-7643",
publisher = "Springer-Verlag London Ltd.",
number = "1",

}

Autoregressive model-based fuzzy clustering and its application for detecting information redundancy in air pollution monitoring networks. / D'Urso, Pierpaolo; Di Lallo, Dario; Maharaj, Elizabeth Ann.

In: Soft Computing, Vol. 17, No. 1, 2013, p. 83 - 131.

Research output: Contribution to journalArticleResearchpeer-review

TY - JOUR

T1 - Autoregressive model-based fuzzy clustering and its application for detecting information redundancy in air pollution monitoring networks

AU - D'Urso, Pierpaolo

AU - Di Lallo, Dario

AU - Maharaj, Elizabeth Ann

PY - 2013

Y1 - 2013

N2 - Fuzzy clustering enables the simultaneous membership of objects in two or more clusters. This is particularly pertinent where time series are concerned, because very often patterns of time series change over time. Thus, a time series might belong to different clusters over different periods of time, in which case, crisp clustering is unable to capture this multi-cluster membership. In this paper, we adopt a Fuzzy C-Medoids approach to clustering time series based on autoregressive estimates of models fitted to the time series. We illustrate very good performance of this approach in a range of simulation studies. By means of two applications, we also show the usefulness of this clustering approach in the air pollution monitoring, by considering air pollution time series, i.e., CO time series, CO2 time series and NO time series monitored on world and urban scales. In particular, we show that, by considering in the clustering process, the autoregressive representation of these air pollution time series, we are able to detect possible information redundancy in the monitoring networks and then, decreasing the number of monitoring stations, to reduce the monitoring costs and then to increase the monitoring efficiency of the networks.

AB - Fuzzy clustering enables the simultaneous membership of objects in two or more clusters. This is particularly pertinent where time series are concerned, because very often patterns of time series change over time. Thus, a time series might belong to different clusters over different periods of time, in which case, crisp clustering is unable to capture this multi-cluster membership. In this paper, we adopt a Fuzzy C-Medoids approach to clustering time series based on autoregressive estimates of models fitted to the time series. We illustrate very good performance of this approach in a range of simulation studies. By means of two applications, we also show the usefulness of this clustering approach in the air pollution monitoring, by considering air pollution time series, i.e., CO time series, CO2 time series and NO time series monitored on world and urban scales. In particular, we show that, by considering in the clustering process, the autoregressive representation of these air pollution time series, we are able to detect possible information redundancy in the monitoring networks and then, decreasing the number of monitoring stations, to reduce the monitoring costs and then to increase the monitoring efficiency of the networks.

U2 - 10.1007/s00500-012-0905-6

DO - 10.1007/s00500-012-0905-6

M3 - Article

VL - 17

SP - 83

EP - 131

JO - Soft Computing

JF - Soft Computing

SN - 1432-7643

IS - 1

ER -