Mining analogical libraries in Q&A discussions - incorporating relational and categorical knowledge into word embedding

Chunyang Chen, Sa Gao, Zhenchang Xing

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearch

Abstract

Third-party libraries are an integral part of many software projects. It often happens that developers need to find analogical libraries that can provide comparable features to the libraries they are already familiar with. Existing methods to find analogical libraries are limited by the community-curated list of libraries, blogs, or Q&A posts, which often contain overwhelming or out-of-date information. In this paper, we present a new approach to recommend analogical libraries based on a knowledge base of analogical libraries mined from tags of millions of Stack Overflow questions. The novelty of our approach is to solve analogical-libraries questions by combining state-of-the-art word embedding technique and domain-specific relational and categorical knowledge mined from Stack Overflow. We implement our approach in a proof-of-concept web application (https://graphofknowledge.appspot.com/similartech). The evaluation results show that our approach can make accurate recommendation of analogical libraries (Precision@1=0.81 and Precision@5=0.67). Google Analytics of the website traffic provides initial evidence of the potential usefulness of our web application for software developers.
Original languageEnglish
Title of host publicationProceedings - 2016 IEEE 23rd International Conference on Software Analysis, Evolution, and Reengineering, SANER 2016
Subtitle of host publication14–18 March 2016 Osaka, Japan
EditorsYasutaka Kamei, Michele Lanza
Place of PublicationPiscataway NJ USA
PublisherIEEE, Institute of Electrical and Electronics Engineers
Pages338-348
Number of pages11
Volume1
ISBN (Electronic)9781509018550
DOIs
Publication statusPublished - 2016
Externally publishedYes
EventIEEE International Conference on Software Analysis, Evolution, and Reengineering 2016 - Suita, Japan
Duration: 14 Mar 201618 Mar 2016
Conference number: 23rd
http://saner.inf.usi.ch/index.php
https://ieeexplore.ieee.org/xpl/conhome/7476313/proceeding (Proceedings)

Conference

ConferenceIEEE International Conference on Software Analysis, Evolution, and Reengineering 2016
Abbreviated titleSANER 2016
Country/TerritoryJapan
CitySuita
Period14/03/1618/03/16
Internet address

Keywords

  • Analogical libraries
  • Word embedding
  • Knowledge graph
  • Relational knowledge
  • Categorical knowledge

Cite this