TY - JOUR
T1 - Machine Learning Techniques to Predict Timeliness of Care among Lung Cancer Patients
AU - Earnest, Arul
AU - Tesema, Getayeneh Antehunegn
AU - Stirling, Robert G.
N1 - Publisher Copyright:
© 2023 by the authors.
PY - 2023/10
Y1 - 2023/10
N2 - Delays in the assessment, management, and treatment of lung cancer patients may adversely impact prognosis and survival. This study is the first to use machine learning techniques to predict the quality and timeliness of care among lung cancer patients, utilising data from the Victorian Lung Cancer Registry (VLCR) between 2011 and 2022, in Victoria, Australia. Predictor variables included demographic, clinical, hospital, and geographical socio-economic indices. Machine learning methods such as random forests, k-nearest neighbour, neural networks, and support vector machines were implemented and evaluated using 20% out-of-sample cross validations via the area under the curve (AUC). Optimal model parameters were selected based on 10-fold cross validation. There were 11,602 patients included in the analysis. Evaluated quality indicators included, primarily, overall proportion achieving “time from referral date to diagnosis date ≤ 28 days” and proportion achieving “time from diagnosis date to first treatment date (any intent) ≤ 14 days”. Results showed that the support vector machine learning methods performed well, followed by nearest neighbour, based on out-of-sample AUCs of 0.89 (in-sample = 0.99) and 0.85 (in-sample = 0.99) for the first indicator, respectively. These models can be implemented in the registry databases to help healthcare workers identify patients who may not meet these indicators prospectively and enable timely interventions.d
AB - Delays in the assessment, management, and treatment of lung cancer patients may adversely impact prognosis and survival. This study is the first to use machine learning techniques to predict the quality and timeliness of care among lung cancer patients, utilising data from the Victorian Lung Cancer Registry (VLCR) between 2011 and 2022, in Victoria, Australia. Predictor variables included demographic, clinical, hospital, and geographical socio-economic indices. Machine learning methods such as random forests, k-nearest neighbour, neural networks, and support vector machines were implemented and evaluated using 20% out-of-sample cross validations via the area under the curve (AUC). Optimal model parameters were selected based on 10-fold cross validation. There were 11,602 patients included in the analysis. Evaluated quality indicators included, primarily, overall proportion achieving “time from referral date to diagnosis date ≤ 28 days” and proportion achieving “time from diagnosis date to first treatment date (any intent) ≤ 14 days”. Results showed that the support vector machine learning methods performed well, followed by nearest neighbour, based on out-of-sample AUCs of 0.89 (in-sample = 0.99) and 0.85 (in-sample = 0.99) for the first indicator, respectively. These models can be implemented in the registry databases to help healthcare workers identify patients who may not meet these indicators prospectively and enable timely interventions.d
KW - lung cancer
KW - machine learning
KW - socio-economic disadvantage
KW - timeliness of care
UR - http://www.scopus.com/inward/record.url?scp=85175374485&partnerID=8YFLogxK
U2 - 10.3390/healthcare11202756
DO - 10.3390/healthcare11202756
M3 - Article
C2 - 37893830
AN - SCOPUS:85175374485
SN - 2227-9032
VL - 11
JO - Healthcare
JF - Healthcare
IS - 20
M1 - 2756
ER -