A comparative study for estimating software development effort intervals

Ayşe Bakir, Burak Turhan, Ayşe Bener

Research output: Contribution to journalArticleResearchpeer-review

11 Citations (Scopus)

Abstract

Software cost/effort estimation is still an open challenge. Many researchers have proposed various methods that usually focus on point estimates. Until today, software cost estimation has been treated as a regression problem. However, in order to prevent overestimates and underestimates, it is more practical to predict the interval of estimations instead of the exact values. In this paper, we propose an approach that converts cost estimation into a classification problem and that classifies new software projects in one of the effort classes, each of which corresponds to an effort interval. Our approach integrates cluster analysis with classification methods. Cluster analysis is used to determine effort intervals while different classification algorithms are used to find corresponding effort classes. The proposed approach is applied to seven public datasets. Our experimental results show that the hit rate obtained for effort estimation are around 90-100%, which is much higher than that obtained by related studies. Furthermore, in terms of point estimation, our results are comparable to those in the literature although a simple mean/median is used for estimation. Finally, the dynamic generation of effort intervals is the most distinctive part of our study, and it results in time and effort gain for project managers through the removal of human intervention.

Original languageEnglish
Pages (from-to)537-552
Number of pages16
JournalSoftware Quality Journal
Volume19
Issue number3
DOIs
Publication statusPublished - 1 Jan 2011
Externally publishedYes

Keywords

  • Classification
  • Cluster analysis
  • Interval prediction
  • Machine learning
  • Software effort estimation

Cite this