TY - JOUR
T1 - A novel hybrid machine learning framework for the prediction of diabetes with context-customized regularization and prediction procedures
AU - Rajagopal, Aghila
AU - Jha, Sudan
AU - Alagarsamy, Ramachandran
AU - Quek, Shio Gai
AU - Selvachandran, Ganeshsree
N1 - Funding Information:
This research was funded by the Ministry of Education, Malaysia under grant no. FRGS/1/2020/STG06/UCSI/02/1 .
Publisher Copyright:
© 2022 International Association for Mathematics and Computers in Simulation (IMACS)
PY - 2022/8
Y1 - 2022/8
N2 - This paper proposes a customized hybrid model of artificial neural network (ANN) and genetic algorithms for an efficient diabetes disease prediction framework. Our customized hybrid model uses an improvised technique of detecting the more visible patterns of relations between the variables. Initially, the input medical dataset is preprocessed using a novel normalization technique that works consistently for all degrees of skewness of data. Then, our proposed decision-making algorithm will correctly identify the degree of importance of each variable in influencing the output, and thus priority will be given to the variables that are deemed most important. This is then followed by the implementation of a regularization method that is custom-made for the prediction of diabetes. Such a customized regularization method is considered asymmetrical because the positive numbers are more favored compared to negative numbers, and this was decided based on the characteristics of the dataset. The proposed technique deals with missing numbers as a separate kind of entity compared to numerical entries and can adapt itself to a given dataset. The proposed customized hybrid model and its accompanying decision-making algorithm were applied to the Pima Indian Diabetes dataset sourced from the UCI Machine Learning Repository with an 80% prediction accuracy.
AB - This paper proposes a customized hybrid model of artificial neural network (ANN) and genetic algorithms for an efficient diabetes disease prediction framework. Our customized hybrid model uses an improvised technique of detecting the more visible patterns of relations between the variables. Initially, the input medical dataset is preprocessed using a novel normalization technique that works consistently for all degrees of skewness of data. Then, our proposed decision-making algorithm will correctly identify the degree of importance of each variable in influencing the output, and thus priority will be given to the variables that are deemed most important. This is then followed by the implementation of a regularization method that is custom-made for the prediction of diabetes. Such a customized regularization method is considered asymmetrical because the positive numbers are more favored compared to negative numbers, and this was decided based on the characteristics of the dataset. The proposed technique deals with missing numbers as a separate kind of entity compared to numerical entries and can adapt itself to a given dataset. The proposed customized hybrid model and its accompanying decision-making algorithm were applied to the Pima Indian Diabetes dataset sourced from the UCI Machine Learning Repository with an 80% prediction accuracy.
KW - Artificial neural network
KW - Asymmetrical regularization
KW - Diabetes prediction
KW - Disease prediction
KW - Genetic algorithm
UR - http://www.scopus.com/inward/record.url?scp=85127063499&partnerID=8YFLogxK
U2 - 10.1016/j.matcom.2022.03.003
DO - 10.1016/j.matcom.2022.03.003
M3 - Article
AN - SCOPUS:85127063499
SN - 0378-4754
VL - 198
SP - 388
EP - 406
JO - Mathematics and Computers in Simulation
JF - Mathematics and Computers in Simulation
ER -