On identifying and explaining similarities in Android apps

Li Li, Tegawendé F. Bissyandé, Hao-Yu Wang, Jacques Klein

Research output: Contribution to journalArticleResearchpeer-review

Abstract

App updates and repackaging are recurrent in the Android ecosystem, filling markets with similar apps that must be identified. Despite the existence of several approaches to improving the scalability of detecting repackaged/cloned apps, researchers and practitioners are eventually faced with the need for a comprehensive pairwise comparison (or simultaneously multiple app comparisons) to understand and validate the similarities among apps. In this work, we present the design and implementation of our research-based prototype tool called SimiDroid for multi-level similarity comparison of Android apps. SimiDroid is built with the aim to support the comprehension of similarities/changes among app versions and among repackaged apps. In particular, we demonstrate the need and usefulness of such a framework based on different case studies implementing different dissection scenarios for revealing various insights on how repackaged apps are built. We further show that the similarity comparison plugins implemented in SimiDroid yield more accurate results than the state of the art.

Original languageEnglish
Pages (from-to)437-455
Number of pages19
JournalJournal of Computer Science and Technology
Volume34
Issue number2
DOIs
Publication statusPublished - Mar 2019

Keywords

  • Android
  • app clone
  • similarity analysis

Cite this

Li, Li ; Bissyandé, Tegawendé F. ; Wang, Hao-Yu ; Klein, Jacques. / On identifying and explaining similarities in Android apps. In: Journal of Computer Science and Technology. 2019 ; Vol. 34, No. 2. pp. 437-455.
@article{175bdab22d174728af532a035c1c0366,
title = "On identifying and explaining similarities in Android apps",
abstract = "App updates and repackaging are recurrent in the Android ecosystem, filling markets with similar apps that must be identified. Despite the existence of several approaches to improving the scalability of detecting repackaged/cloned apps, researchers and practitioners are eventually faced with the need for a comprehensive pairwise comparison (or simultaneously multiple app comparisons) to understand and validate the similarities among apps. In this work, we present the design and implementation of our research-based prototype tool called SimiDroid for multi-level similarity comparison of Android apps. SimiDroid is built with the aim to support the comprehension of similarities/changes among app versions and among repackaged apps. In particular, we demonstrate the need and usefulness of such a framework based on different case studies implementing different dissection scenarios for revealing various insights on how repackaged apps are built. We further show that the similarity comparison plugins implemented in SimiDroid yield more accurate results than the state of the art.",
keywords = "Android, app clone, similarity analysis",
author = "Li Li and Bissyand{\'e}, {Tegawend{\'e} F.} and Hao-Yu Wang and Jacques Klein",
year = "2019",
month = "3",
doi = "10.1007/s11390-019-1918-8",
language = "English",
volume = "34",
pages = "437--455",
journal = "Journal of Computer Science and Technology",
issn = "1000-9000",
publisher = "Springer-Verlag London Ltd.",
number = "2",

}

On identifying and explaining similarities in Android apps. / Li, Li; Bissyandé, Tegawendé F.; Wang, Hao-Yu; Klein, Jacques.

In: Journal of Computer Science and Technology, Vol. 34, No. 2, 03.2019, p. 437-455.

Research output: Contribution to journalArticleResearchpeer-review

TY - JOUR

T1 - On identifying and explaining similarities in Android apps

AU - Li, Li

AU - Bissyandé, Tegawendé F.

AU - Wang, Hao-Yu

AU - Klein, Jacques

PY - 2019/3

Y1 - 2019/3

N2 - App updates and repackaging are recurrent in the Android ecosystem, filling markets with similar apps that must be identified. Despite the existence of several approaches to improving the scalability of detecting repackaged/cloned apps, researchers and practitioners are eventually faced with the need for a comprehensive pairwise comparison (or simultaneously multiple app comparisons) to understand and validate the similarities among apps. In this work, we present the design and implementation of our research-based prototype tool called SimiDroid for multi-level similarity comparison of Android apps. SimiDroid is built with the aim to support the comprehension of similarities/changes among app versions and among repackaged apps. In particular, we demonstrate the need and usefulness of such a framework based on different case studies implementing different dissection scenarios for revealing various insights on how repackaged apps are built. We further show that the similarity comparison plugins implemented in SimiDroid yield more accurate results than the state of the art.

AB - App updates and repackaging are recurrent in the Android ecosystem, filling markets with similar apps that must be identified. Despite the existence of several approaches to improving the scalability of detecting repackaged/cloned apps, researchers and practitioners are eventually faced with the need for a comprehensive pairwise comparison (or simultaneously multiple app comparisons) to understand and validate the similarities among apps. In this work, we present the design and implementation of our research-based prototype tool called SimiDroid for multi-level similarity comparison of Android apps. SimiDroid is built with the aim to support the comprehension of similarities/changes among app versions and among repackaged apps. In particular, we demonstrate the need and usefulness of such a framework based on different case studies implementing different dissection scenarios for revealing various insights on how repackaged apps are built. We further show that the similarity comparison plugins implemented in SimiDroid yield more accurate results than the state of the art.

KW - Android

KW - app clone

KW - similarity analysis

UR - http://www.scopus.com/inward/record.url?scp=85063627262&partnerID=8YFLogxK

U2 - 10.1007/s11390-019-1918-8

DO - 10.1007/s11390-019-1918-8

M3 - Article

VL - 34

SP - 437

EP - 455

JO - Journal of Computer Science and Technology

JF - Journal of Computer Science and Technology

SN - 1000-9000

IS - 2

ER -