TY - JOUR
T1 - An empirical study based on semi-supervised hybrid self-organizing map for software fault prediction
AU - Abaei, Golnoush
AU - Selamat, Ali
AU - Fujita, Hamido
N1 - Funding Information:
The authors would like to thank Zahra Rezaei dan M. Reza Mashinchi from Software Engineering Research Group, Universiti Teknologi Malaysia (UTM) for providing some comments in improving the manuscript. Moreover, this work is supported by the Research Management Centre (RMC) at the Universiti Teknologi Malaysia under Research University Grant ( Vot 01G72 ) and the Ministry of Science, Technology & Innovations Malaysia under Science Fund ( Vot 4S062 ).
Publisher Copyright:
© 2014 Elsevier B.V. All rights reserved.
Copyright:
Copyright 2017 Elsevier B.V., All rights reserved.
PY - 2015
Y1 - 2015
N2 - Software testing is a crucial task during software development process with the potential to save time and budget by recognizing defects as early as possible and delivering a more defect-free product. To improve the testing process, fault prediction approaches identify parts of the system that are more defect prone. However, when the defect data or quality-based class labels are not identified or the company does not have similar or earlier versions of the software project, researchers cannot use supervised classification methods for defect detection. In order to detect defect proneness of modules in software projects with high accuracy and improve detection model generalization ability, we propose an automated software fault detection model using semi-supervised hybrid self-organizing map (HySOM). HySOM is a semi-supervised model based on self-organizing map and artificial neural network. The advantage of HySOM is the ability to predict the label of the modules in a semi-supervised manner using software measurement threshold values in the absence of quality data. In semi-supervised HySOM, the role of expert for identifying fault prone modules becomes less critical and more supportive. We have benchmarked the proposed model with eight industrial data sets from NASA and Turkish white-goods embedded controller software. The results show improvement in false negative rate and overall error rate in 80% and 60% of the cases respectively for NASA data sets. Moreover, we investigate the performance of the proposed model with other recent proposed methods. According to the results, our semi-supervised model can be used as an automated tool to guide testing effort by prioritizing the module's defects improving the quality of software development and software testing in less time and budget.
AB - Software testing is a crucial task during software development process with the potential to save time and budget by recognizing defects as early as possible and delivering a more defect-free product. To improve the testing process, fault prediction approaches identify parts of the system that are more defect prone. However, when the defect data or quality-based class labels are not identified or the company does not have similar or earlier versions of the software project, researchers cannot use supervised classification methods for defect detection. In order to detect defect proneness of modules in software projects with high accuracy and improve detection model generalization ability, we propose an automated software fault detection model using semi-supervised hybrid self-organizing map (HySOM). HySOM is a semi-supervised model based on self-organizing map and artificial neural network. The advantage of HySOM is the ability to predict the label of the modules in a semi-supervised manner using software measurement threshold values in the absence of quality data. In semi-supervised HySOM, the role of expert for identifying fault prone modules becomes less critical and more supportive. We have benchmarked the proposed model with eight industrial data sets from NASA and Turkish white-goods embedded controller software. The results show improvement in false negative rate and overall error rate in 80% and 60% of the cases respectively for NASA data sets. Moreover, we investigate the performance of the proposed model with other recent proposed methods. According to the results, our semi-supervised model can be used as an automated tool to guide testing effort by prioritizing the module's defects improving the quality of software development and software testing in less time and budget.
KW - Artificial neural network
KW - Clustering
KW - Self-organizing maps
KW - Semi-supervised
KW - Software fault prediction
KW - Threshold
UR - http://www.scopus.com/inward/record.url?scp=84926215512&partnerID=8YFLogxK
U2 - 10.1016/j.knosys.2014.10.017
DO - 10.1016/j.knosys.2014.10.017
M3 - Article
AN - SCOPUS:84926215512
VL - 74
SP - 28
EP - 39
JO - Knowledge-Based Systems
JF - Knowledge-Based Systems
SN - 0950-7051
ER -