Modelling the prevalence of diabetes mellitus risk factors based on artificial neural network and multiple regression

Kamal Gholipour, Mohammad Asghari-Jafarabadi, Shabnam Iezadi, Ali Jannati, Sina Keshavarz

Research output: Contribution to journalReview ArticleResearchpeer-review

20 Citations (Scopus)

Abstract

Background: Type 2 diabetes mellitus (T2DM) is a metabolic disease with complex causes, manifestations, complications and management. Understanding the wide range of risk factors for T2DM can facilitate diagnosis, proper classification and cost-effective management of the disease. Aims: To compare the power of an artificial neural network (ANN) and logistic regression in identifying T2DM risk factors. Methods: This descriptive and analytical study was conducted in 2013. The study samples were all residents aged 15–64 years of rural and urban areas in East Azerbaijan, Islamic Republic of Iran, who consented to participate (n = 990). The latest data available were collected from the Noncommunicable Disease Surveillance System of East Azerbaijan Province (2007). Data were analysed using SPSS version 19. Results: Based on multiple logistic regression, age, family history of T2DM and residence were the most important risk factors for T2DM. Based on ANN, age, body mass index and current smoking were most important. To test for generaliza-tion, ANN and logistic regression were evaluated using the area under the receiver operating characteristic curve (AUC). The AUC was 0.726 (SE = 0.025) and 0.717 (SE = 0.026) for logistic regression and ANN, respectively (P < 0.001). Conclusions: The logistic regression model is better than ANN and it is clinically more comprehensible.

Original languageEnglish
Pages (from-to)770-777
Number of pages8
JournalEastern Mediterranean Health Journal
Volume24
Issue number8
DOIs
Publication statusPublished - 2018
Externally publishedYes

Keywords

  • Artificial neural network
  • Diabetes mellitus
  • Multiple regression
  • Risk factors

Cite this