Skip to main navigation Skip to search Skip to main content

An experimental comparison of clone detection techniques using Java Bytecode

Jean Guy Schneider, Sung Une Lee

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

Abstract

It is generally accepted in Software Engineering that code clones - often the result of copy-and-paste of existing code - result in poorer maintainability of software systems. Consequently, a variety of techniques have been devised to detect cloned code in software systems and alert developers of duplicated code. Most techniques operate at the source-code level and require some combination of pretty-printing, tokenization and abstraction in order to improve the comparison of code fragments over purely string-based techniques. Avoiding some of the issues of source-code based approaches, we are investigating the effectiveness of using various similarity measures on Bytecode to identify code clones in Java-based systems in this work. The results of our evaluation on selected Java systems indicate that instruction sequences can be used to effectively detect identical code clones. Especially, we achieved the best performance when using the normalized edit distance among applied similarity measures.

Original languageEnglish
Title of host publicationProceedings - 2022 29th Asia-Pacific Software Engineering Conference, APSEC 2022
EditorsEunjong Choi, DongGyun Han
Place of PublicationPiscataway NJ USA
PublisherIEEE, Institute of Electrical and Electronics Engineers
Pages139-148
Number of pages10
ISBN (Electronic)9781665455374
DOIs
Publication statusPublished - 2022
EventAsia-Pacific Software Engineering Conference 2022 - Online, Japan
Duration: 6 Dec 20229 Dec 2022
Conference number: 29th
https://ieeexplore.ieee.org/xpl/conhome/10043104/proceeding (Proceedings)
https://conf.researchr.org/home/apsec-2022 (Website)

Publication series

NameProceedings - Asia-Pacific Software Engineering Conference, APSEC
PublisherIEEE, Institute of Electrical and Electronics Engineers
Volume2022-December
ISSN (Print)1530-1362
ISSN (Electronic)2640-0715

Conference

ConferenceAsia-Pacific Software Engineering Conference 2022
Abbreviated titleAPSEC 2022
Country/TerritoryJapan
Period6/12/229/12/22
Internet address

Keywords

  • Bytecode
  • code clone
  • Java
  • sequence alignment
  • similarity distance

Cite this