TY - JOUR
T1 - Modelling the prevalence of diabetes mellitus risk factors based on artificial neural network and multiple regression
AU - Gholipour, Kamal
AU - Asghari-Jafarabadi, Mohammad
AU - Iezadi, Shabnam
AU - Jannati, Ali
AU - Keshavarz, Sina
N1 - Publisher Copyright:
© World Health Organization (WHO) 2018. Some rights reserved.
PY - 2018
Y1 - 2018
N2 - Background: Type 2 diabetes mellitus (T2DM) is a metabolic disease with complex causes, manifestations, complications and management. Understanding the wide range of risk factors for T2DM can facilitate diagnosis, proper classification and cost-effective management of the disease. Aims: To compare the power of an artificial neural network (ANN) and logistic regression in identifying T2DM risk factors. Methods: This descriptive and analytical study was conducted in 2013. The study samples were all residents aged 15–64 years of rural and urban areas in East Azerbaijan, Islamic Republic of Iran, who consented to participate (n = 990). The latest data available were collected from the Noncommunicable Disease Surveillance System of East Azerbaijan Province (2007). Data were analysed using SPSS version 19. Results: Based on multiple logistic regression, age, family history of T2DM and residence were the most important risk factors for T2DM. Based on ANN, age, body mass index and current smoking were most important. To test for generaliza-tion, ANN and logistic regression were evaluated using the area under the receiver operating characteristic curve (AUC). The AUC was 0.726 (SE = 0.025) and 0.717 (SE = 0.026) for logistic regression and ANN, respectively (P < 0.001). Conclusions: The logistic regression model is better than ANN and it is clinically more comprehensible.
AB - Background: Type 2 diabetes mellitus (T2DM) is a metabolic disease with complex causes, manifestations, complications and management. Understanding the wide range of risk factors for T2DM can facilitate diagnosis, proper classification and cost-effective management of the disease. Aims: To compare the power of an artificial neural network (ANN) and logistic regression in identifying T2DM risk factors. Methods: This descriptive and analytical study was conducted in 2013. The study samples were all residents aged 15–64 years of rural and urban areas in East Azerbaijan, Islamic Republic of Iran, who consented to participate (n = 990). The latest data available were collected from the Noncommunicable Disease Surveillance System of East Azerbaijan Province (2007). Data were analysed using SPSS version 19. Results: Based on multiple logistic regression, age, family history of T2DM and residence were the most important risk factors for T2DM. Based on ANN, age, body mass index and current smoking were most important. To test for generaliza-tion, ANN and logistic regression were evaluated using the area under the receiver operating characteristic curve (AUC). The AUC was 0.726 (SE = 0.025) and 0.717 (SE = 0.026) for logistic regression and ANN, respectively (P < 0.001). Conclusions: The logistic regression model is better than ANN and it is clinically more comprehensible.
KW - Artificial neural network
KW - Diabetes mellitus
KW - Multiple regression
KW - Risk factors
UR - http://www.scopus.com/inward/record.url?scp=85055080856&partnerID=8YFLogxK
U2 - 10.26719/EMHJ.18.012
DO - 10.26719/EMHJ.18.012
M3 - Review Article
C2 - 30328607
AN - SCOPUS:85055080856
SN - 1020-3397
VL - 24
SP - 770
EP - 777
JO - Eastern Mediterranean Health Journal
JF - Eastern Mediterranean Health Journal
IS - 8
ER -