TY - JOUR
T1 - An ensemble interval prediction model with change point detection and interval perturbation-based adjustment strategy
T2 - A case study of air quality
AU - Jiang, Feng
AU - Zhu, Qiannan
AU - Tian, Tianhai
N1 - Funding Information:
This work was supported by the National Natural Science Foundation of China (Grant Nos. 61773401 and 11931019 ), the Humanities and Social Science Research Foundation of the Ministry of Education of China (Grant No. 22YJAZH038 ), the Hubei Province Key Laboratory of Systems Science in Metallurgical Process (Wuhan University of Science and Technology) (Grant No. Y202206 ), and the Hubei Key Laboratory of Big Data in Science and Technology (Grant No. KF2022005 ).
Publisher Copyright:
© 2023 Elsevier Ltd
PY - 2023/7/15
Y1 - 2023/7/15
N2 - Point prediction has been used to predict air pollutant concentrations in recent years. However, it is still a challenge to characterize the time series data of pollutant concentrations in the presence of high volatility and uncertainty. Since interval prediction can quantity this uncertainty and provide more information than point prediction, we propose an improved interval prediction model based on lower upper bound estimation (LUBE) to construct the prediction intervals (PIs) of PM2.5 concentrations. First, we decompose the original time series into a trend term and a fluctuation term. Then, the trend term with regularity is analyzed by point prediction, while the fluctuation term with uncertainty is studied by LUBE interval prediction. To improve the efficiency and stabilize the randomness in the existing LUBE, we propose a new method based on change point detection and interval perturbation-based adjustment strategy (IPAS). IPAS is used to replace the optimization algorithm in LUBE for improving efficiency. Meanwhile, change point detection is introduced to optimize the initialized parameters in LUBE. In addition, since PM2.5 concentration is influenced by various factors, partial autocorrelation function and maximal information coefficient are applied to select the optimal input features from meteorological and other air pollution factors associated with PM2.5 concentrations. Moreover, we ensemble the prediction results of the trend term and fluctuation term to obtain the ultimate PIs. To evaluate the effectiveness and efficiency of our proposed model, the daily PM2.5 concentration data in Wuhan, China, are analyzed in the empirical study. Comparison results and ablation study clearly indicate that our proposed model has better comprehensive performance than classical models and benchmark models. The proposed new method can provide high-quality PIs and achieve better stability.
AB - Point prediction has been used to predict air pollutant concentrations in recent years. However, it is still a challenge to characterize the time series data of pollutant concentrations in the presence of high volatility and uncertainty. Since interval prediction can quantity this uncertainty and provide more information than point prediction, we propose an improved interval prediction model based on lower upper bound estimation (LUBE) to construct the prediction intervals (PIs) of PM2.5 concentrations. First, we decompose the original time series into a trend term and a fluctuation term. Then, the trend term with regularity is analyzed by point prediction, while the fluctuation term with uncertainty is studied by LUBE interval prediction. To improve the efficiency and stabilize the randomness in the existing LUBE, we propose a new method based on change point detection and interval perturbation-based adjustment strategy (IPAS). IPAS is used to replace the optimization algorithm in LUBE for improving efficiency. Meanwhile, change point detection is introduced to optimize the initialized parameters in LUBE. In addition, since PM2.5 concentration is influenced by various factors, partial autocorrelation function and maximal information coefficient are applied to select the optimal input features from meteorological and other air pollution factors associated with PM2.5 concentrations. Moreover, we ensemble the prediction results of the trend term and fluctuation term to obtain the ultimate PIs. To evaluate the effectiveness and efficiency of our proposed model, the daily PM2.5 concentration data in Wuhan, China, are analyzed in the empirical study. Comparison results and ablation study clearly indicate that our proposed model has better comprehensive performance than classical models and benchmark models. The proposed new method can provide high-quality PIs and achieve better stability.
KW - Change point detection
KW - Interval perturbation-based adjustment strategy
KW - Interval prediction
KW - Lower upper bound estimation
KW - PM2.5 concentration
UR - http://www.scopus.com/inward/record.url?scp=85150047679&partnerID=8YFLogxK
U2 - 10.1016/j.eswa.2023.119823
DO - 10.1016/j.eswa.2023.119823
M3 - Article
AN - SCOPUS:85150047679
SN - 0957-4174
VL - 222
JO - Expert Systems with Applications
JF - Expert Systems with Applications
M1 - 119823
ER -