What’s Spain’s Paris? Mining analogical libraries from Q&A discussions

Chunyang Chen, Zhenchang Xing, Yang Liu

Research output: Contribution to journalArticleResearchpeer-review

Abstract

Third-party libraries are an integral part of many software projects. It often happens that developers need to find analogical libraries that can provide comparable features to the libraries they are already familiar with for different programming languages or different mobile platforms. Existing methods to find analogical libraries are limited by the community-curated list of libraries, blogs, or Q&A posts, which often contain overwhelming or out-of-date information. In this paper, we present a new approach to recommend analogical libraries based on a knowledge base of analogical libraries mined from tags of millions of Stack Overflow questions. The novelty of our approach is to solve analogical-library questions by combining state-of-the-art word embedding technique and domain-specific relational and categorical knowledge mined from Stack Overflow. Given a library and a recommended analogical library, our approach further extracts questions and answer snippets in Stack Overflow about comparison of analogical libraries, which can potentially offer useful information scents for developers to further their investigation of the recommended analogical libraries. We implement our approach in a proof-of-concept web application and more than 34.8 thousands of users visited our website from November 2015 to August 2017. Our evaluation shows that our approach can make accurate recommendation of analogical libraries. We also demonstrate the usefulness of our analogical-library recommendations by using them to answer analogical-library questions in Stack Overflow. Google Analytics of our website traffic and analysis of the visitors’ interaction with website contents provide the insights into the usage patterns and the system design of our web application.

Original languageEnglish
Pages (from-to)1155-1194
Number of pages40
JournalEmpirical Software Engineering
Volume24
Issue number3
DOIs
Publication statusPublished - Jun 2019

Keywords

  • Analogical libraries
  • Categorical knowledge
  • Knowledge graph
  • Relational knowledge
  • Word embedding

Cite this

Chen, Chunyang ; Xing, Zhenchang ; Liu, Yang. / What’s Spain’s Paris? Mining analogical libraries from Q&A discussions. In: Empirical Software Engineering. 2019 ; Vol. 24, No. 3. pp. 1155-1194.
@article{dc7b23e262874a9ea556cf896f831799,
title = "What’s Spain’s Paris? Mining analogical libraries from Q&A discussions",
abstract = "Third-party libraries are an integral part of many software projects. It often happens that developers need to find analogical libraries that can provide comparable features to the libraries they are already familiar with for different programming languages or different mobile platforms. Existing methods to find analogical libraries are limited by the community-curated list of libraries, blogs, or Q&A posts, which often contain overwhelming or out-of-date information. In this paper, we present a new approach to recommend analogical libraries based on a knowledge base of analogical libraries mined from tags of millions of Stack Overflow questions. The novelty of our approach is to solve analogical-library questions by combining state-of-the-art word embedding technique and domain-specific relational and categorical knowledge mined from Stack Overflow. Given a library and a recommended analogical library, our approach further extracts questions and answer snippets in Stack Overflow about comparison of analogical libraries, which can potentially offer useful information scents for developers to further their investigation of the recommended analogical libraries. We implement our approach in a proof-of-concept web application and more than 34.8 thousands of users visited our website from November 2015 to August 2017. Our evaluation shows that our approach can make accurate recommendation of analogical libraries. We also demonstrate the usefulness of our analogical-library recommendations by using them to answer analogical-library questions in Stack Overflow. Google Analytics of our website traffic and analysis of the visitors’ interaction with website contents provide the insights into the usage patterns and the system design of our web application.",
keywords = "Analogical libraries, Categorical knowledge, Knowledge graph, Relational knowledge, Word embedding",
author = "Chunyang Chen and Zhenchang Xing and Yang Liu",
year = "2019",
month = "6",
doi = "10.1007/s10664-018-9657-y",
language = "English",
volume = "24",
pages = "1155--1194",
journal = "Empirical Software Engineering",
issn = "1382-3256",
publisher = "Springer-Verlag London Ltd.",
number = "3",

}

What’s Spain’s Paris? Mining analogical libraries from Q&A discussions. / Chen, Chunyang; Xing, Zhenchang; Liu, Yang.

In: Empirical Software Engineering, Vol. 24, No. 3, 06.2019, p. 1155-1194.

Research output: Contribution to journalArticleResearchpeer-review

TY - JOUR

T1 - What’s Spain’s Paris? Mining analogical libraries from Q&A discussions

AU - Chen, Chunyang

AU - Xing, Zhenchang

AU - Liu, Yang

PY - 2019/6

Y1 - 2019/6

N2 - Third-party libraries are an integral part of many software projects. It often happens that developers need to find analogical libraries that can provide comparable features to the libraries they are already familiar with for different programming languages or different mobile platforms. Existing methods to find analogical libraries are limited by the community-curated list of libraries, blogs, or Q&A posts, which often contain overwhelming or out-of-date information. In this paper, we present a new approach to recommend analogical libraries based on a knowledge base of analogical libraries mined from tags of millions of Stack Overflow questions. The novelty of our approach is to solve analogical-library questions by combining state-of-the-art word embedding technique and domain-specific relational and categorical knowledge mined from Stack Overflow. Given a library and a recommended analogical library, our approach further extracts questions and answer snippets in Stack Overflow about comparison of analogical libraries, which can potentially offer useful information scents for developers to further their investigation of the recommended analogical libraries. We implement our approach in a proof-of-concept web application and more than 34.8 thousands of users visited our website from November 2015 to August 2017. Our evaluation shows that our approach can make accurate recommendation of analogical libraries. We also demonstrate the usefulness of our analogical-library recommendations by using them to answer analogical-library questions in Stack Overflow. Google Analytics of our website traffic and analysis of the visitors’ interaction with website contents provide the insights into the usage patterns and the system design of our web application.

AB - Third-party libraries are an integral part of many software projects. It often happens that developers need to find analogical libraries that can provide comparable features to the libraries they are already familiar with for different programming languages or different mobile platforms. Existing methods to find analogical libraries are limited by the community-curated list of libraries, blogs, or Q&A posts, which often contain overwhelming or out-of-date information. In this paper, we present a new approach to recommend analogical libraries based on a knowledge base of analogical libraries mined from tags of millions of Stack Overflow questions. The novelty of our approach is to solve analogical-library questions by combining state-of-the-art word embedding technique and domain-specific relational and categorical knowledge mined from Stack Overflow. Given a library and a recommended analogical library, our approach further extracts questions and answer snippets in Stack Overflow about comparison of analogical libraries, which can potentially offer useful information scents for developers to further their investigation of the recommended analogical libraries. We implement our approach in a proof-of-concept web application and more than 34.8 thousands of users visited our website from November 2015 to August 2017. Our evaluation shows that our approach can make accurate recommendation of analogical libraries. We also demonstrate the usefulness of our analogical-library recommendations by using them to answer analogical-library questions in Stack Overflow. Google Analytics of our website traffic and analysis of the visitors’ interaction with website contents provide the insights into the usage patterns and the system design of our web application.

KW - Analogical libraries

KW - Categorical knowledge

KW - Knowledge graph

KW - Relational knowledge

KW - Word embedding

UR - http://www.scopus.com/inward/record.url?scp=85053705232&partnerID=8YFLogxK

U2 - 10.1007/s10664-018-9657-y

DO - 10.1007/s10664-018-9657-y

M3 - Article

VL - 24

SP - 1155

EP - 1194

JO - Empirical Software Engineering

JF - Empirical Software Engineering

SN - 1382-3256

IS - 3

ER -