Abstract
To improve software engineering, software repositories have been mined for code snippets and bug fixes. Typically, this mining takes place at the level of files or commits. To be able to dig deeper and to extract insights at a higher resolution, we hereby present an annotated dataset that contains over 7 million edits of code and text on Stack Overflow. Our preliminary study indicates that these edits might be a treasure trove for mining information about fine-grained patches, e.g., for the optimisation of non-functional properties.
Original language | English |
---|---|
Title of host publication | Proceedings of the 2020 Genetic and Evolutionary Computation Conference Companion |
Editors | Carlos A. Coello Coello |
Place of Publication | New York NY USA |
Publisher | Association for Computing Machinery (ACM) |
Pages | 1923-1925 |
Number of pages | 3 |
ISBN (Electronic) | 9781450371278 |
DOIs | |
Publication status | Published - 2020 |
Externally published | Yes |
Event | The Genetic and Evolutionary Computation Conference 2020 - Cancun, Mexico Duration: 8 Jul 2020 → 12 Jul 2020 Conference number: 22nd https://gecco-2020.sigevo.org/index.html/HomePage https://dl.acm.org/doi/proceedings/10.1145/3377930 (Proceedings) |
Conference
Conference | The Genetic and Evolutionary Computation Conference 2020 |
---|---|
Abbreviated title | GECCO 2020 |
Country/Territory | Mexico |
City | Cancun |
Period | 8/07/20 → 12/07/20 |
Internet address |
Keywords
- Mining software repositories
- Patches
- Software documentation
- Software evolution
- Stack overflow