Abstract
Software change-proneness prediction (whether or not class files in a project will be changed in the next release) can help software developers to focus on preventive actions to reduce maintenance costs, and managers to allocate resources more effectively. Prior studies found that change-proneness prediction works well if there is sufficient amount of training data to build a model. However, it is not feasible for projects with limited historical data especially for new projects. To address this issue, cross-project change-proneness prediction, which builds a prediction model by using data in another project (i.e., source project), and predicts the change-proneness in a target project, is proposed. Considering there are a large number of source projects, one challenge for cross-project change-proneness prediction is that given a target project, how to automatically select a source project which could show good prediction accuracy on it. In this paper, we propose a selective cross-project (SCP) model for change-proneness prediction. SCP automatically finds the source project which has the similar data distribution with the target project by measuring distribution similarity between source and target projects. We evaluate SCP by conducting an empirical study on 14 open source projects. We compare it with 2 most related change-proneness models, including RCP (Random Cross-Project prediction) proposed by Malhotra and Bansal, and CLAMI+ developed by Yan et al. Experiment results show that SCP improves RCP and CLAMI+ by 25.34% and 4.30% in terms of AUC respectively; and by 171.42% and 172.31% in terms of cost-effectiveness, respectively.
Original language | English |
---|---|
Title of host publication | Proceedings - 2018 IEEE 42nd Annual Computer Software and Applications Conference, COMPSAC 2018 |
Subtitle of host publication | 23-27 July 2018 Tokyo, Japan |
Editors | Sorel Reisman, Sheikh Iqbal Ahamed, Claudio Demartini, Thomas Conte, Ling Liu, William Claycomb, Motonori Nakamura, Edmundo Tovar, Stelvio Cimato, Chung-Horng Lung, Hiroki Takakura, Ji-Jiang Yang, Toyokazu Akiyama, Zhiyong Zhang, Kamrul Hasan |
Place of Publication | Piscataway NJ USA |
Publisher | IEEE, Institute of Electrical and Electronics Engineers |
Pages | 64-73 |
Number of pages | 10 |
Volume | 1 |
ISBN (Print) | 9781538626665 |
DOIs | |
Publication status | Published - 2018 |
Event | International Computer Software and Applications Conference 2018 - Tokyo, Japan Duration: 23 Jul 2018 → 27 Jul 2018 Conference number: 42nd https://ieeecompsac.computer.org/2018/ https://ieeexplore.ieee.org/xpl/conhome/8376143/proceeding (Proceedings) |
Conference
Conference | International Computer Software and Applications Conference 2018 |
---|---|
Abbreviated title | COMPSAC 2018 |
Country/Territory | Japan |
City | Tokyo |
Period | 23/07/18 → 27/07/18 |
Internet address |
Keywords
- Change-Proneness
- Cross-Project Prediction
- Maintainability
- Project Selection