Abstract
Developers often look for solutions to programming problems in community Q&A sites like Stack Overflow. Due to the crowdsourcing nature of these Q&A sites, many user-provided answers are wrong, less optimal or out-of-date. Relying on community-curated quality indicators (e.g., accepted answer, answer vote) cannot reliably identify these answer problems. Such problematic answers are often criticized by other users. However, these critiques are not readily discoverable when reading the posts. In this paper, we consider the answers being criticized and their critique posts as controversial discussions in community Q&A sites. To help developers notice such controversial discussions and make more informed choices of appropriate solutions, we design an automatic open information extraction approach for systematically discovering and summarizing the controversies in Stack Overflow and exploiting official API documentation to assist the understanding of the discovered controversies. We apply our approach to millions of Java/android-tagged Stack overflow questions and answers and discover a large scale of controversial discussions in Stack Overflow. Our manual evaluation confirms that the extracted controversy information is of high accuracy. A user study with 18 developers demonstrates the usefulness of our generated controversy summaries in helping developers avoid the controversial answers and choose more appropriate solutions to programming questions.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 2019 34th IEEE/ACM International Conference on Automated Software Engineering, ASE 2019 |
| Editors | Julia Lawall, Darko Marinov |
| Place of Publication | Piscataway NJ USA |
| Publisher | IEEE, Institute of Electrical and Electronics Engineers |
| Pages | 151-162 |
| Number of pages | 12 |
| ISBN (Electronic) | 9781728125084 |
| ISBN (Print) | 9781728125091 |
| DOIs | |
| Publication status | Published - 2019 |
| Event | Automated Software Engineering Conference 2019 - San Diego, United States of America Duration: 10 Nov 2019 → 15 Nov 2019 Conference number: 34th https://2019.ase-conferences.org/ (Conference website) https://dl.acm.org/doi/proceedings/10.5555/3382508 (Proceedings) |
Conference
| Conference | Automated Software Engineering Conference 2019 |
|---|---|
| Abbreviated title | ASE 2019 |
| Country/Territory | United States of America |
| City | San Diego |
| Period | 10/11/19 → 15/11/19 |
| Internet address |
|
Keywords
- Controversial discussion
- Open information extraction
- Sentence embedding
- Stack Overflow
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver