An experience report on defect modelling in practice: pitfalls and challenges

Chakkrit Tantithamthavorn, Ahmed E. Hassan

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

77 Citations (Scopus)


Over the past decade with the rise of the Mining Software Repositories (MSR)field, the modelling of defects for large and long-lived systems has become one of the most common applications of MSR. The findings and approaches of such studies have attracted the attention of many of our industrial collaborators (and other practitioners worldwide). At the core of many of these studies is the development and use of analytical models for defects. In this paper, we discuss common pitfalls and challenges that we observed as practitioners attempt to develop such models or reason about the findings of such studies. The key goal of this paper is to document such pitfalls and challenges so practitioners can avoid them in future efforts. We also hope that other academics will be mindful of such pitfalls and challenges in their own work and industrial engagements.

Original languageEnglish
Title of host publicationProceedings - 2018 ACM/IEEE 40th International Conference on Software Engineering: Software Engineering in Practice - ICSE-SEIP 2018
Subtitle of host publication30 May – 1 June 2018 Gothenburg, Sweden
EditorsIvica Crnkovic
Place of PublicationNew York NY USA
PublisherIEEE, Institute of Electrical and Electronics Engineers
Number of pages10
ISBN (Electronic)9781450356596
Publication statusPublished - 2018
Externally publishedYes
EventInternational Conference on Software Engineering 2018: Software Engineering in Practice Track - Gothenburg, Sweden
Duration: 27 May 20183 Jun 2018
Conference number: 40th (Proceedings)


ConferenceInternational Conference on Software Engineering 2018
Abbreviated titleICSE-SEIP 2018
OtherTrack from International Conference on Software Engineering 2018
Internet address


  • Defect modelling
  • Empirical software engineering
  • Experimental design
  • Mining software repositories
  • Software analytics

Cite this