Projects per year
Abstract
Just-In-Time (JIT) defect prediction (i.e., an AI/ML model to predict defect-introducing commits) is proposed to help developers prioritize their limited Software Quality Assurance (SQA) resources on the most risky commits. However, the explainability of JIT defect models remains largely unexplored (i.e., practitioners still do not know why a commit is predicted as defect-introducing). Recently, LIME has been used to generate explanations for any AI/ML models. However, the random perturbation approach used by LIME to generate synthetic neighbors is still suboptimal, i.e., generating synthetic neighbors that may not be similar to an instance to be explained, producing low accuracy of the local models, leading to inaccurate explanations for just-in-time defect models. In this paper, we propose PyExplainer - i.e., a local rule-based model-agnostic technique for generating explanations (i.e., why a commit is predicted as defective) of JIT defect models. Through a case study of two open-source software projects, we find that our PyExplainer produces (1) synthetic neighbors that are 41%-45% more similar to an instance to be explained; (2) 18%-38% more accurate local models; and (3) explanations that are 69%-98% more unique and 17%-54% more consistent with the actual characteristics of defect-introducing commits in the future than LIME (a state-ofthe-art model-agnostic technique). This could help practitioners focus on the most important aspects of the commits to mitigate the risk of being defect-introducing. Thus, the contributions of this paper build an important step towards Explainable AI for Software Engineering, making software analytics more explainable and actionable. Finally, we publish our PyExplainer as a Python package to support practitioners and researchers (https://github.com/awsm-research/PyExplainer).
Original language | English |
---|---|
Title of host publication | Proceedings - 2021 36th IEEE/ACM International Conference on Automated Software Engineering, ASE 2021 |
Editors | Dan Hao, Denys Poshyvanyk |
Place of Publication | Piscataway NJ USA |
Publisher | IEEE, Institute of Electrical and Electronics Engineers |
Pages | 407-418 |
Number of pages | 12 |
ISBN (Electronic) | 9781665403375 |
ISBN (Print) | 9781665447843 |
DOIs | |
Publication status | Published - 2021 |
Event | International Conference on Automated Software Engineering 2021 - Online, Australia Duration: 15 Nov 2021 → 19 Nov 2021 Conference number: 36th https://ieeexplore.ieee.org/xpl/conhome/9678507/proceeding (Website) |
Publication series
Name | Proceedings - 2021 36th IEEE/ACM International Conference on Automated Software Engineering, ASE 2021 |
---|---|
Publisher | IEEE, Institute of Electrical and Electronics Engineers |
ISSN (Print) | 1938-4300 |
ISSN (Electronic) | 2643-1572 |
Conference
Conference | International Conference on Automated Software Engineering 2021 |
---|---|
Abbreviated title | ASE 2021 |
Country/Territory | Australia |
Period | 15/11/21 → 19/11/21 |
Internet address |
Keywords
- Explainable AI
- Explainable Software Analytics
- Just-In-Time Defect Prediction
Projects
- 1 Finished
-
Practical and Explainable Analytics to Prevent Future Software Defects
Australian Research Council (ARC)
2/03/20 → 1/03/23
Project: Research