Skip to main navigation Skip to search Skip to main content

Toward a Better Understanding of Probabilistic Delta Debugging

  • Mengxiao Zhang
  • , Zhenyang Xu
  • , Yongqiang Tian
  • , Xinru Cheng
  • , Chengnian Sun

Research output: Chapter in Book/Report/Conference proceedingConference PaperResearchpeer-review

Abstract

Given a list L of elements and a property ψ that L exhibits, ddmin is a classic test input minimization algorithm that aims to automatically remove ψ-irrelevant elements from L. This algorithm has been widely adopted in domains such as test input minimization and software debloating. Recently, ProbDD, a variant of ddmin, has been proposed and achieved state-of-the-art performance. By employing Bayesian optimization, ProbDD estimates the probability of each element in L being relevant to ψ, and statistically decides which and how many elements should be deleted together each time. However, the theoretical probabilistic model of ProbDD is rather intricate, and the underlying details for the superior performance of ProbDD have not been adequately explored. In this paper, we conduct the first in-depth theoretical analysis of ProbDD, clarifying the trends in probability and subset size changes and simplifying the probability model. We complement this analysis with empirical experiments, including success rate analysis, ablation studies, and examinations of trade-offs and limitations, to further comprehend and demystify this state-of-the-art algorithm. Our success rate analysis reveals how ProbDD effectively addresses bottlenecks that slow down ddmin by skipping inefficient queries that attempt to delete complements of subsets and previously tried subsets. The ablation study illustrates that randomness in ProbDD has no significant impact on efficiency. These findings provide valuable insights for future research and applications of test input minimization algorithms. Based on the findings above, we propose CDD, a simplified version of ProbDD, reducing the complexity in both theory and implementation. CDD assists in 1 validating the correctness of our key findings, e.g., that probabilities in ProbDD essentially serve as monotonically increasing counters for each element, and 2 identifying the main factors that truly contribute to ProbDD's superior performance. Our comprehensive evaluations across 76 benchmarks in test input minimization and software debloating demonstrate that CDD can achieve the same performance as ProbDD, despite being much simplified.

Original languageEnglish
Title of host publicationProceedings - 2025 IEEE/ACM 47th International Conference on Software Engineering, ICSE 2025
EditorsOpeyemi Adesina, Majid Babaei
Place of PublicationPiscataway NJ USA
PublisherIEEE, Institute of Electrical and Electronics Engineers
Pages2024-2035
Number of pages12
ISBN (Electronic)9798331505691
ISBN (Print)9798331505707
DOIs
Publication statusPublished - 2025
Externally publishedYes
EventInternational Conference on Software Engineering 2025 - Ottawa, Canada
Duration: 27 Apr 20253 May 2025
Conference number: 47th
https://conf.researchr.org/home/icse-2025
https://conf.researchr.org/info/icse-2025/proceedings
https://www.computer.org/csdl/proceedings/icse/2025/215aVCUIQmY

Publication series

NameProceedings - International Conference on Software Engineering
PublisherIEEE, Institute of Electrical and Electronics Engineers
ISSN (Print)0270-5257
ISSN (Electronic)1558-1225

Conference

ConferenceInternational Conference on Software Engineering 2025
Abbreviated titleICSE 2025
Country/TerritoryCanada
CityOttawa
Period27/04/253/05/25
Internet address

Keywords

  • Delta Debugging
  • Program Reduction
  • Software Debloating
  • Test Input Minimization

Cite this