TY - JOUR
T1 - Better software analytics via “DUO”
T2 - data mining algorithms using/used-by optimizers
AU - Agrawal, Amritanshu
AU - Menzies, Tim
AU - Minku, Leandro L.
AU - Wagner, Markus
AU - Yu, Zhe
N1 - Funding Information:
Earlier work ultimately leading to the present one was inspired by the NII Shonan Meeting on Data-Driven Search-based Software Engineering (goo.gl/f8D3EC), December 11-14, 2017. We thank the organizers of that workshop (Markus Wagner, Leandro L. Minku, Ahmed E. Hassan, and John Clark) for their academic leadership and inspiration. Dr Menzies’ work was partially supported by NSF grant No. 1703487. Dr Minku’s work was partially supported by EPSRC grant Nos. EP/R006660/1 and EP/R006660/2. Dr Wagner’s work was partially supported by the ARC grant DE160100850.
Funding Information:
Earlier work ultimately leading to the present one was inspired by the NII Shonan Meeting on Data-Driven Search-based Software Engineering (goo.gl/f8D3EC), December 11-14, 2017. We thank the organizers of that workshop (Markus Wagner, Leandro L. Minku, Ahmed E. Hassan, and John Clark) for their academic leadership and inspiration. Dr Menzies’ work was partially supported by NSF grant No. 1703487. Dr Minku’s work was partially supported by EPSRC grant Nos. EP/R006660/1 and EP/R006660/2. Dr Wagner’s work was partially supported by the ARC grant DE160100850.
Publisher Copyright:
© 2020, Springer Science+Business Media, LLC, part of Springer Nature.
PY - 2020/12
Y1 - 2020/12
N2 - This paper claims that a new field of empirical software engineering research and practice is emerging: data mining using/used-by optimizers for empirical studies, or DUO. For example, data miners can generate models that are explored by optimizers. Also, optimizers can advise how to best adjust the control parameters of a data miner. This combined approach acts like an agent leaning over the shoulder of an analyst that advises “ask this question next” or “ignore that problem, it is not relevant to your goals”. Further, those agents can help us build “better” predictive models, where “better” can be either greater predictive accuracy or faster modeling time (which, in turn, enables the exploration of a wider range of options). We also caution that the era of papers that just use data miners is coming to an end. Results obtained from an unoptimized data miner can be quickly refuted, just by applying an optimizer to produce a different (and better performing) model. Our conclusion, hence, is that for software analytics it is possible, useful and necessary to combine data mining and optimization using DUO.
AB - This paper claims that a new field of empirical software engineering research and practice is emerging: data mining using/used-by optimizers for empirical studies, or DUO. For example, data miners can generate models that are explored by optimizers. Also, optimizers can advise how to best adjust the control parameters of a data miner. This combined approach acts like an agent leaning over the shoulder of an analyst that advises “ask this question next” or “ignore that problem, it is not relevant to your goals”. Further, those agents can help us build “better” predictive models, where “better” can be either greater predictive accuracy or faster modeling time (which, in turn, enables the exploration of a wider range of options). We also caution that the era of papers that just use data miners is coming to an end. Results obtained from an unoptimized data miner can be quickly refuted, just by applying an optimizer to produce a different (and better performing) model. Our conclusion, hence, is that for software analytics it is possible, useful and necessary to combine data mining and optimization using DUO.
KW - Data mining
KW - Evolutionary algorithms
KW - Optimization
KW - Software analytics
UR - https://www.scopus.com/pages/publications/85084054663
U2 - 10.1007/s10664-020-09808-9
DO - 10.1007/s10664-020-09808-9
M3 - Article
AN - SCOPUS:85084054663
SN - 1382-3256
VL - 25
SP - 2099
EP - 2136
JO - Empirical Software Engineering
JF - Empirical Software Engineering
IS - 3
ER -