Optimisation and calibration of Bayesian neural network for probabilistic prediction of biogas performance in an anaerobic lagoon

Benjamin Steven Vien, Thomas Kuen, Louis Raymond Francis Rose, Wing Kong Chiu

Research output: Contribution to journalArticleResearchpeer-review

1 Citation (Scopus)

Abstract

This study aims to enhance diagnostic capabilities for optimising the performance of the anaerobic sewage treatment lagoon at Melbourne Water’s Western Treatment Plant (WTP) through a novel machine learning (ML)-based monitoring strategy. This strategy employs ML to make accurate probabilistic predictions of biogas performance by leveraging diverse real-life operational and inspection sensor and other measurement data for asset management, decision making, and structural health monitoring (SHM). The paper commences with data analysis and preprocessing of complex irregular datasets to facilitate efficient learning in an artificial neural network. Subsequently, a Bayesian mixture density neural network model incorporating an attention-based mechanism in bidirectional long short-term memory (BiLSTM) was developed. This probabilistic approach uses a distribution output layer based on the Gaussian mixture model and Monte Carlo (MC) dropout technique in estimating data and model uncertainties, respectively. Furthermore, systematic hyperparameter optimisation revealed that the optimised model achieved a negative log-likelihood (NLL) of 0.074, significantly outperforming other configurations. It achieved an accuracy approximately 9 times greater than the average model performance (NLL = 0.753) and 22 times greater than the worst performing model (NLL = 1.677). Key factors influencing the model’s accuracy, such as the input window size and the number of hidden units in the BiLSTM layer, were identified, while the number of neurons in the fully connected layer was found to have no significant impact on accuracy. Moreover, model calibration using the expected calibration error was performed to correct the model’s predictive uncertainty. The findings suggest that the inherent data significantly contribute to the overall uncertainty of the model, highlighting the need for more high-quality data to enhance learning. This study lays the groundwork for applying ML in transforming high-value assets into intelligent structures and has broader implications for ML in asset management, SHM applications, and renewable energy sectors.

Original languageEnglish
Article number2537
Number of pages25
JournalSensors
Volume24
Issue number8
DOIs
Publication statusPublished - 15 Apr 2024

Keywords

  • anaerobic lagoon
  • Bayesian neural network
  • deep learning
  • hyperparameter optimisation
  • long short-term memory
  • machine learning
  • mixture density
  • model calibration
  • Monte Carlo dropout

Cite this