Skip to main navigation Skip to search Skip to main content

Deep ensemble machine learning with Bayesian blending improved accuracy and precision of modelled ground-level ozone for region with sparse monitoring: Australia, 2005–2018

I. C. Hanigan, W. Yu, C. Yuen, K. Gopi, L. D. Knibbs, C. T. Cowie, B. Jalaludin, M. Cope, M. L. Riley, J. Heyworth, L. Morawska, G. B. Marks, G. G. Morgan, Yuming Guo (Leading Author)

Research output: Contribution to journalArticleResearchpeer-review

Abstract

Ground-level ozone (O3) is a significant public health concern. We developed maps of monthly average 1-h maximum O3 concentrations in New South Wales, Australia (2005–2018), a region with sparse monitoring. For the first time Bayesian Maximum Entropy (BME) blending was used within a Deep Ensemble Machine Learning (DEML) framework for air pollution predictions. The DEML combined geographical predictors in random forest (RF), extreme gradient boosting (XGBoost), and gradient boosted machine (GBM) models with three meta-models. BME blending incorporated observed O3 data into posterior predictions. We generated 2.5 km × 2.5 km resolution gridded surfaces. The DEML estimates achieved an R2 of 0.89 and RMSE of 2.3 ppb in the held-out test dataset at monitors. DEML grid cell predictions (R2: 0.84, RMSE: 3.03 ppb) were improved by BME blending (R2: 0.89, RMSE: 2.49 ppb). Mean bias reduced from −0.7 ppb to −0.4 ppb. This demonstrates high accuracy and precision in a sparsely monitored region.

Original languageEnglish
Article number106378
Number of pages12
JournalEnvironmental Modelling and Software
Volume187
DOIs
Publication statusPublished - Apr 2025

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Keywords

  • Air pollution epidemiology
  • Bayesian maximum entropy (BME)
  • Blending
  • Deep ensemble machine learning (DEML)
  • Exposure modelling
  • Tropospheric ozone

Cite this