The mantra of the PROMISE series is "repeatable, improvable, maybe refutable" software engineering experiments. This community has successfully created a library of reusable software engineering data sets. The next challenge in the PROMISE community will be to not only share data, but to share experiments. Our experience with existing data mining environments is that these tools are not suitable for publishing or sharing repeatable experiments. OURMINE is an environment for the development of data mining experiments. OURMINE offers a succinct notation for describing experiments. Adding new tools to OURMINE, in a variety of languages, is a rapid and simple process. This makes it a useful research tool. Complicated graphical interfaces have been eschewed for simple command-line prompts. This simplifies the learning curve for data mining novices. The simplicity also encourages large scale modification and experimentation with the code. In this paper, we show the OURMINE code required to reproduce a recent experiment checking how defect predictors learned from one site apply to another. This is an important result for the PROMISE community since it shows that our shared repository is not just a useful academic resource. Rather, it is a valuable resource industry: companies that lack the local data required to build those predictors can use PROMISE data to build defect predictors.
|Title of host publication||PROMISE 2009 - International Conference on Predictor Models in Software Engineering|
|Publication status||Published - 1 Dec 2009|
|Event||5th International Conference on Predictor Models in Software Engineering, PROMISE '09 - Vancouver, BC, Canada|
Duration: 18 May 2009 → 19 May 2009
|Conference||5th International Conference on Predictor Models in Software Engineering, PROMISE '09|
|Period||18/05/09 → 19/05/09|