Abstract
Defect prediction is an important topic in software quality research. Statistical models for defect prediction can be built on project repositories. Project repositories store software metrics and defect information. This information is then matched with software modules. Naive Bayes is a well known, simple statistical technique that assumes the 'independence' and 'equal importance' of features, which are not true in many problems. However, Naive Bayes achieves high performances on a wide spectrum of prediction problems. This paper addresses the 'equal importance' of features assumption of Naive Bayes. We propose that by means of heuristics we can assign weights to features according to their importance and improve defect prediction performance. We compare the weighted Naive Bayes and the standard Naive Bayes predictors' performances on publicly available datasets. Our experimental results indicate that assigning weights to software metrics increases the prediction performance significantly.
Original language | English |
---|---|
Pages | 244-249 |
Number of pages | 6 |
Publication status | Published - 1 Dec 2007 |
Externally published | Yes |
Event | International Conference on Software and Data Technologies 2007 - Barcelona, Spain Duration: 22 Jul 2007 → 25 Jul 2007 Conference number: 2nd https://link.springer.com/book/10.1007/978-3-540-88655-6 (Proceedings) |
Conference
Conference | International Conference on Software and Data Technologies 2007 |
---|---|
Abbreviated title | ICSOFT 2007 |
Country/Territory | Spain |
City | Barcelona |
Period | 22/07/07 → 25/07/07 |
Internet address |
|
Keywords
- Defect prediction
- Empirical software engineering
- Feature weighting
- Naïve bayes
- Software metrics
- Software quality