TY - JOUR
T1 - Predicting volatile fatty acid synthesis from palm oil mill effluent on an industrial scale
AU - Shaw, Kar Ming
AU - Poh, Phaik Eong
AU - Ho, Yong Kuen
AU - Chan, Sow Keong
AU - Chew, Irene Mei Leng
N1 - Funding Information:
We thank Green Lagoon Technology Sdn Bhd for the research collaboration and supplying the plant operational data for this study.
Publisher Copyright:
© 2022 Elsevier B.V.
PY - 2022/11
Y1 - 2022/11
N2 - To determine the optimal conditions of anaerobic digestion of palm oil mill effluent (POME) for volatile fatty acid (VFA) synthesis on an industrial scale can be costly and time-consuming due to the large number of variables needed to be tested. In this study, variational autoencoder (VAE) was used to generate simulated data to expand the small size dataset. Then, machine learning models were developed to predict VFA synthesis. Using VAE, 250 simulated data were generated from 25 real data, and a total of 275 samples used for training/validation/testing of machine learning models. The dataset consists of features such as POME volume, COD loading, organic loading rate, temperature, pH. Random forest (RF) and artificial neural network (ANN) were trained and tested on predictions of VFA synthesis. Generally, ANN performed better than RF. To improve the performance, the 3 best performing ANN models were selected for ensemble neural network, where it has robust performances with RMSE of 55.80, R2 of 0.968 on test data. Features importance analysis indicated that temperature was the most important factor influencing VFA synthesis, suggesting that temperature control could improve VFA synthesis. Overall, ANN is robust to predict VFA synthesis while features importance analysis could provide the guidance for adjustment of optimal conditions, reducing the resources and workloads of experiments in real situations.
AB - To determine the optimal conditions of anaerobic digestion of palm oil mill effluent (POME) for volatile fatty acid (VFA) synthesis on an industrial scale can be costly and time-consuming due to the large number of variables needed to be tested. In this study, variational autoencoder (VAE) was used to generate simulated data to expand the small size dataset. Then, machine learning models were developed to predict VFA synthesis. Using VAE, 250 simulated data were generated from 25 real data, and a total of 275 samples used for training/validation/testing of machine learning models. The dataset consists of features such as POME volume, COD loading, organic loading rate, temperature, pH. Random forest (RF) and artificial neural network (ANN) were trained and tested on predictions of VFA synthesis. Generally, ANN performed better than RF. To improve the performance, the 3 best performing ANN models were selected for ensemble neural network, where it has robust performances with RMSE of 55.80, R2 of 0.968 on test data. Features importance analysis indicated that temperature was the most important factor influencing VFA synthesis, suggesting that temperature control could improve VFA synthesis. Overall, ANN is robust to predict VFA synthesis while features importance analysis could provide the guidance for adjustment of optimal conditions, reducing the resources and workloads of experiments in real situations.
KW - Anaerobic digestion
KW - Artificial neural networks
KW - Machine learning
KW - Waste treatment
UR - http://www.scopus.com/inward/record.url?scp=85140784859&partnerID=8YFLogxK
U2 - 10.1016/j.bej.2022.108671
DO - 10.1016/j.bej.2022.108671
M3 - Article
AN - SCOPUS:85140784859
SN - 1369-703X
VL - 187
JO - Biochemical Engineering Journal
JF - Biochemical Engineering Journal
M1 - 108671
ER -