TY - JOUR
T1 - Optimization of breast mass classification using sequential forward floating selection (SFFS) and a support vector machine (SVM) model
AU - Tan, Maxine
AU - Pu, Jiantao
AU - Zheng, Bin
N1 - Funding Information:
Acknowledgments This work was supported in part by the National Cancer Institute, National Institutes of Health, under Grant R01CA 160205.
Publisher Copyright:
© 2014, CARS.
Copyright:
Copyright 2021 Elsevier B.V., All rights reserved.
PY - 2014/11
Y1 - 2014/11
N2 - Purpose: Improving radiologists’ performance in classification between malignant and benign breast lesions is important to increase cancer detection sensitivity and reduce false-positive recalls. For this purpose, developing computer-aided diagnosis schemes has been attracting research interest in recent years. In this study, we investigated a new feature selection method for the task of breast mass classification.Methods: We initially computed 181 image features based on mass shape, spiculation, contrast, presence of fat or calcifications, texture, isodensity, and other morphological features. From this large image feature pool, we used a sequential forward floating selection (SFFS)-based feature selection method to select relevant features and analyzed their performance using a support vector machine (SVM) model trained for the classification task. On a database of 600 benign and 600 malignant mass regions of interest, we performed the study using a tenfold cross-validation method. Feature selection and optimization of the SVM parameters were conducted on the training subsets only.Conclusion: In conclusion, this comprehensive feature analysis study provided new and valuable information for optimizing computerized mass classification schemes that may have potential to be useful as a “second reader” in future clinical practice.Results: The area under the receiver operating characteristic curve (Formula presented.) was obtained for the classification task. The results also showed that the most frequently selected features by the SFFS-based algorithm in tenfold iterations were those related to mass shape, isodensity, and presence of fat, which are consistent with the image features frequently used by radiologists in the clinical environment for mass classification. The study also indicated that accurately computing mass spiculation features from the projection mammograms was difficult, and failed to perform well for the mass classification task due to tissue overlap within the benign mass regions.
AB - Purpose: Improving radiologists’ performance in classification between malignant and benign breast lesions is important to increase cancer detection sensitivity and reduce false-positive recalls. For this purpose, developing computer-aided diagnosis schemes has been attracting research interest in recent years. In this study, we investigated a new feature selection method for the task of breast mass classification.Methods: We initially computed 181 image features based on mass shape, spiculation, contrast, presence of fat or calcifications, texture, isodensity, and other morphological features. From this large image feature pool, we used a sequential forward floating selection (SFFS)-based feature selection method to select relevant features and analyzed their performance using a support vector machine (SVM) model trained for the classification task. On a database of 600 benign and 600 malignant mass regions of interest, we performed the study using a tenfold cross-validation method. Feature selection and optimization of the SVM parameters were conducted on the training subsets only.Conclusion: In conclusion, this comprehensive feature analysis study provided new and valuable information for optimizing computerized mass classification schemes that may have potential to be useful as a “second reader” in future clinical practice.Results: The area under the receiver operating characteristic curve (Formula presented.) was obtained for the classification task. The results also showed that the most frequently selected features by the SFFS-based algorithm in tenfold iterations were those related to mass shape, isodensity, and presence of fat, which are consistent with the image features frequently used by radiologists in the clinical environment for mass classification. The study also indicated that accurately computing mass spiculation features from the projection mammograms was difficult, and failed to perform well for the mass classification task due to tissue overlap within the benign mass regions.
KW - Breast cancer
KW - Computer-aided diagnosis of mammograms
KW - Feature selection
KW - Pattern classification
UR - http://www.scopus.com/inward/record.url?scp=84896440084&partnerID=8YFLogxK
U2 - 10.1007/s11548-014-0992-1
DO - 10.1007/s11548-014-0992-1
M3 - Article
C2 - 24664267
AN - SCOPUS:84896440084
VL - 9
SP - 1005
EP - 1020
JO - International Journal of Computer Assisted Radiology and Surgery
JF - International Journal of Computer Assisted Radiology and Surgery
SN - 1861-6410
IS - 6
ER -