The impact of changes mislabeled by SZZ on Just-in-Time defect prediction

Yuanrui Fan, Xin Xia, Daniel Alencar da Costa, David Lo, Ahmed E. Hassan, Shanping Li

Research output: Contribution to journalArticleResearchpeer-review

56 Citations (Scopus)

Abstract

Just-in-Time (JIT) defect prediction---a technique which aims to predict bugs at change level---has been paid more attention. JIT defect prediction leverages the SZZ approach to identify bug-introducing changes. Recently, researchers found that the performance of SZZ (including its variants) is impacted by many noises. SZZ may considerably mislabel changes that are used to train a JIT defect prediction model, and thus impact the prediction accuracy.

Original languageEnglish
Pages (from-to)1559-1586
Number of pages26
JournalIEEE Transactions on Software Engineering
Volume47
Issue number8
DOIs
Publication statusPublished - 1 Aug 2021

Keywords

  • Just-in-Time Defect Prediction
  • Mining Software Repositories
  • Noisy Data
  • SZZ

Cite this