Combining word embedding with information retrieval to recommend similar bug reports

Xinli Yang, David Lo, Xin Xia, Lingfeng Bao, Jianling Sun

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

73 Citations (Scopus)


Similar bugs are bugs that require handling of many common code files. Developers can often fix similar bugs with a shorter time and a higher quality since they can focus on fewer code files. Therefore, similar bug recommendation is a meaningful task which can improve development efficiency. Rocha et al. propose the first similar bug recommendation system named NextBug. Although NextBug performs better than a start-of-the-art duplicated bug detection technique REP, its performance is not optimal and thus more work is needed to improve its effectiveness. Technically, it is also rather simple as it relies only upon a standard information retrieval technique, i.e., cosine similarity. In the paper, we propose a novel approach to recommend similar bugs. The approach combines a traditional information retrieval technique and a word embedding technique, and takes bug titles and descriptions as well as bug product and component information into consideration. To evaluate the approach, we use datasets from two popular open-source projects, i.e., Eclipse and Mozilla, each of which contains bug reports whose bug ids range from [1,400000]. The results show that our approach improves the performance of NextBug statistically significantly and substantially for both projects.

Original languageEnglish
Title of host publicationProceedings - 2016 IEEE 27th International Symposium on Software Reliability Engineering, ISSRE 2016
Subtitle of host publication23-27 October 2016 - Ottawa, Ontario, Canada
EditorsAlexander Romanovsky, Elena Troubitsyna
Place of PublicationPiscataway NJ USA
PublisherIEEE, Institute of Electrical and Electronics Engineers
Number of pages11
ISBN (Electronic)9781467390019, 9781467390033
Publication statusPublished - 2016
Externally publishedYes
EventInternational Symposium on Software Reliability Engineering 2016 - Ottawa, Canada
Duration: 23 Oct 201627 Oct 2016
Conference number: 27th


ConferenceInternational Symposium on Software Reliability Engineering 2016
Abbreviated titleISSRE 2016
Internet address


  • Information Retrieval
  • Recommendation Systems
  • Similar Bugs
  • Word Embedding

Cite this