TY - JOUR
T1 - Forecasting Scrub Typhus Cases in Eight High-Risk Counties in China
T2 - Evaluation of Time-Series Model Performance
AU - He, Junyu
AU - Wei, Xianyu
AU - Yin, Wenwu
AU - Wang, Yong
AU - Qian, Quan
AU - Sun, Hailong
AU - Xu, Yuanyong
AU - Soares Magalhaes, Ricardo J.
AU - Guo, Yuming
AU - Zhang, Wenyi
N1 - Funding Information:
This work is partly supported by the China Postdoctoral Science Foundation (2020M681825) and the projects from the 13th Five-Year Plan (Nos.18QNP063 and 17SAZ01).
Publisher Copyright:
Copyright © 2022 He, Wei, Yin, Wang, Qian, Sun, Xu, Soares Magalhaes, Guo and Zhang.
PY - 2022/1/12
Y1 - 2022/1/12
N2 - Scrub typhus (ST) is expanding its geographical distribution in China and in many regions worldwide raising significant public health concerns. Accurate ST time-series modeling including uncovering the role of environmental determinants is of great importance to guide disease control purposes. This study evaluated the performance of three competing time-series modeling approaches at forecasting ST cases during 2012–2020 in eight high-risk counties in China. We evaluated the performance of a seasonal autoregressive-integrated moving average (SARIMA) model, a SARIMA model with exogenous variables (SARIMAX), and the long–short term memory (LSTM) model to depict temporal variations in ST cases. In our investigation, we considered eight environmental variables known to be associated with ST landscape epidemiology, including the normalized difference vegetation index (NDVI), temperature, precipitation, atmospheric pressure, sunshine duration, relative humidity, wind speed, and multivariate El Niño/Southern Oscillation index (MEI). The first 8-year data and the last year data were used to fit the models and forecast ST cases, respectively. Our results showed that the inclusion of exogenous variables in the SARIMAX model generally outperformed the SARIMA model. Our results also indicate that the role of exogenous variables with various temporal lags varies between counties, suggesting that ST cases are temporally non-stationary. In conclusion, our study demonstrates that the approach to forecast ST cases needed to take into consideration local conditions in that time-series model performance differed between high-risk areas under investigation. Furthermore, the introduction of time-series models, especially LSTM, has enriched the ability of local public health authorities in ST high-risk areas to anticipate and respond to ST outbreaks, such as setting up an early warning system and forecasting ST precisely.
AB - Scrub typhus (ST) is expanding its geographical distribution in China and in many regions worldwide raising significant public health concerns. Accurate ST time-series modeling including uncovering the role of environmental determinants is of great importance to guide disease control purposes. This study evaluated the performance of three competing time-series modeling approaches at forecasting ST cases during 2012–2020 in eight high-risk counties in China. We evaluated the performance of a seasonal autoregressive-integrated moving average (SARIMA) model, a SARIMA model with exogenous variables (SARIMAX), and the long–short term memory (LSTM) model to depict temporal variations in ST cases. In our investigation, we considered eight environmental variables known to be associated with ST landscape epidemiology, including the normalized difference vegetation index (NDVI), temperature, precipitation, atmospheric pressure, sunshine duration, relative humidity, wind speed, and multivariate El Niño/Southern Oscillation index (MEI). The first 8-year data and the last year data were used to fit the models and forecast ST cases, respectively. Our results showed that the inclusion of exogenous variables in the SARIMAX model generally outperformed the SARIMA model. Our results also indicate that the role of exogenous variables with various temporal lags varies between counties, suggesting that ST cases are temporally non-stationary. In conclusion, our study demonstrates that the approach to forecast ST cases needed to take into consideration local conditions in that time-series model performance differed between high-risk areas under investigation. Furthermore, the introduction of time-series models, especially LSTM, has enriched the ability of local public health authorities in ST high-risk areas to anticipate and respond to ST outbreaks, such as setting up an early warning system and forecasting ST precisely.
KW - China
KW - environmental factors
KW - LSTM model
KW - SARIMA model
KW - SARIMAX model
KW - scrub typhus
KW - time-series modeling
UR - http://www.scopus.com/inward/record.url?scp=85123401074&partnerID=8YFLogxK
U2 - 10.3389/fenvs.2021.783864
DO - 10.3389/fenvs.2021.783864
M3 - Article
AN - SCOPUS:85123401074
SN - 2296-665X
VL - 9
JO - Frontiers in Environmental Science
JF - Frontiers in Environmental Science
M1 - 783864
ER -