TY - JOUR
T1 - Machine Learning Algorithms Versus Classical Regression Models in Pre-Eclampsia Prediction
T2 - A Systematic Review
AU - Tiruneh, Sofonyas Abebaw
AU - Vu, Tra Thuan Thanh
AU - Rolnik, Daniel Lorber
AU - Teede, Helena J.
A2 - Enticott, Joanne
N1 - Funding Information:
Open Access funding enabled and organized by CAUL and its Member Institutions No specific funding was received for this review. SAT received PhD scholarship support from the Monash Graduate Scholarship (MGS) and the Monash International Tuition Scholarship (MITS). HT received NHMRC fellowship support (2009326) and Medical Research Future Fund support from the Australian Government.
Publisher Copyright:
© The Author(s) 2024.
PY - 2024/7
Y1 - 2024/7
N2 - Purpose of Review: Machine learning (ML) approaches are an emerging alternative for healthcare risk prediction. We aimed to synthesise the literature on ML and classical regression studies exploring potential prognostic factors and to compare prediction performance for pre-eclampsia. Recent Findings: From 9382 studies retrieved, 82 were included. Sixty-six publications exclusively reported eighty-four classical regression models to predict variable timing of onset of pre-eclampsia. Another six publications reported purely ML algorithms, whilst another 10 publications reported ML algorithms and classical regression models in the same sample with 8 of 10 findings that ML algorithms outperformed classical regression models. The most frequent prognostic factors were age, pre-pregnancy body mass index, chronic medical conditions, parity, prior history of pre-eclampsia, mean arterial pressure, uterine artery pulsatility index, placental growth factor, and pregnancy-associated plasma protein A. Top performing ML algorithms were random forest (area under the curve (AUC) = 0.94, 95% confidence interval (CI) 0.91–0.96) and extreme gradient boosting (AUC = 0.92, 95% CI 0.90–0.94). The competing risk model had similar performance (AUC = 0.92, 95% CI 0.91–0.92) compared with a neural network. Calibration performance was not reported in the majority of publications. Summary: ML algorithms had better performance compared to classical regression models in pre-eclampsia prediction. Random forest and boosting-type algorithms had the best prediction performance. Further research should focus on comparing ML algorithms to classical regression models using the same samples and evaluation metrics to gain insight into their performance. External validation of ML algorithms is warranted to gain insights into their generalisability.
AB - Purpose of Review: Machine learning (ML) approaches are an emerging alternative for healthcare risk prediction. We aimed to synthesise the literature on ML and classical regression studies exploring potential prognostic factors and to compare prediction performance for pre-eclampsia. Recent Findings: From 9382 studies retrieved, 82 were included. Sixty-six publications exclusively reported eighty-four classical regression models to predict variable timing of onset of pre-eclampsia. Another six publications reported purely ML algorithms, whilst another 10 publications reported ML algorithms and classical regression models in the same sample with 8 of 10 findings that ML algorithms outperformed classical regression models. The most frequent prognostic factors were age, pre-pregnancy body mass index, chronic medical conditions, parity, prior history of pre-eclampsia, mean arterial pressure, uterine artery pulsatility index, placental growth factor, and pregnancy-associated plasma protein A. Top performing ML algorithms were random forest (area under the curve (AUC) = 0.94, 95% confidence interval (CI) 0.91–0.96) and extreme gradient boosting (AUC = 0.92, 95% CI 0.90–0.94). The competing risk model had similar performance (AUC = 0.92, 95% CI 0.91–0.92) compared with a neural network. Calibration performance was not reported in the majority of publications. Summary: ML algorithms had better performance compared to classical regression models in pre-eclampsia prediction. Random forest and boosting-type algorithms had the best prediction performance. Further research should focus on comparing ML algorithms to classical regression models using the same samples and evaluation metrics to gain insight into their performance. External validation of ML algorithms is warranted to gain insights into their generalisability.
KW - Calibration
KW - Classical regression
KW - Discrimination
KW - Machine learning
KW - Pre-eclampsia
KW - Prediction models
UR - http://www.scopus.com/inward/record.url?scp=85194899292&partnerID=8YFLogxK
U2 - 10.1007/s11906-024-01297-1
DO - 10.1007/s11906-024-01297-1
M3 - Article
C2 - 38806766
AN - SCOPUS:85194899292
SN - 1522-6417
VL - 26
SP - 309
EP - 323
JO - Current Hypertension Reports
JF - Current Hypertension Reports
IS - 7
ER -