TY - CHAP
T1 - Explainable AI for colorectal cancer classification
AU - Mulenga, Mwenge
AU - Seera, Manjeevan
AU - Kareem, Sameem Abdul
AU - Sabri, Aznul Qalid Md
PY - 2024
Y1 - 2024
N2 - Colorectal cancer (CRC) ranks second highest in global mortality among nonsex-related cancers. Conventional machine learning (ML) algorithms applied to microbiome-based CRC detection often yield suboptimal accuracy. Conversely, deep neural network (DNN)-based methods encounter limitations due to scarce labeled samples, data imbalance, and dominant features. The lack of interpretability in artificial intelligence models further hinders their adoption in healthcare. This chapter proposes an explainable DNN model for improved CRC detection utilizing stool-based microbiome data. The model employs a square root-based normalization method and a feature extension approach, incorporating customized normalization techniques to enhance prediction performance. These methods effectively address outliers, dominant features, and dimensionality challenges. The square root-based method mitigates the effect of outliers and feature dominance, while the feature extension technique expands the dataset’s feature space, potentially improving feature relevance across samples. Leveraging automatic feature selection by the DNN algorithm, the model performs classification using a subset of available features. Evaluation on publicly available datasets demonstrates the efficacy of the proposed methods, with the square root-based method achieving area under the curve scores of 91.3% and 75.8% on datasets 1 and 2, respectively. The feature extension-based method achieves AUC scores of 90.2% and 74% on the respective datasets.
AB - Colorectal cancer (CRC) ranks second highest in global mortality among nonsex-related cancers. Conventional machine learning (ML) algorithms applied to microbiome-based CRC detection often yield suboptimal accuracy. Conversely, deep neural network (DNN)-based methods encounter limitations due to scarce labeled samples, data imbalance, and dominant features. The lack of interpretability in artificial intelligence models further hinders their adoption in healthcare. This chapter proposes an explainable DNN model for improved CRC detection utilizing stool-based microbiome data. The model employs a square root-based normalization method and a feature extension approach, incorporating customized normalization techniques to enhance prediction performance. These methods effectively address outliers, dominant features, and dimensionality challenges. The square root-based method mitigates the effect of outliers and feature dominance, while the feature extension technique expands the dataset’s feature space, potentially improving feature relevance across samples. Leveraging automatic feature selection by the DNN algorithm, the model performs classification using a subset of available features. Evaluation on publicly available datasets demonstrates the efficacy of the proposed methods, with the square root-based method achieving area under the curve scores of 91.3% and 75.8% on datasets 1 and 2, respectively. The feature extension-based method achieves AUC scores of 90.2% and 74% on the respective datasets.
KW - Colorectal cancer
KW - Microbiome data
KW - Deep neural networks
KW - Normalization techniques
KW - Feature dominance
U2 - 10.1007/978-981-97-3705-5_10
DO - 10.1007/978-981-97-3705-5_10
M3 - Chapter (Book)
SN - 9789819737048
T3 - Computational Intelligence Methods and Applications
SP - 203
EP - 223
BT - Explainable AI in Health Informatics
A2 - Aluvalu, Rajanikanth
A2 - Mehta, Mayuri
A2 - Siarry, Patrick
PB - Springer
CY - Singapore Singapore
ER -