Syntactic Vs. Semantic similarity of Artificial and Real Faults in Mutation Testing Studies
Milos Ojdanic, Aayush Garg, Ahmed Khanfir, Renzo Degiovanni, Mike, Papadakis, Yves Le Traon

TL;DR
This study investigates whether syntactic similarity in fault seeding correlates with semantic similarity to real faults, revealing that syntactic similarity does not necessarily imply semantic similarity, and compares the effectiveness of four fault seeding techniques.
Contribution
The paper provides an empirical comparison of four fault seeding techniques, demonstrating that syntactic similarity does not guarantee semantic similarity and evaluating their fault detection capabilities.
Findings
Syntactic similarity does not reflect semantic similarity.
CodeBERT and PiTest have similar fault detection capabilities.
IBIR is the most cost-effective fault seeding technique.
Abstract
Fault seeding is typically used in controlled studies to evaluate and compare test techniques. Central to these techniques lies the hypothesis that artificially seeded faults involve some form of realistic properties and thus provide realistic experimental results. In an attempt to strengthen realism, a recent line of research uses advanced machine learning techniques, such as deep learning and Natural Language Processing (NLP), to seed faults that look like (syntactically) real ones, implying that fault realism is related to syntactic similarity. This raises the question of whether seeding syntactically similar faults indeed results in semantically similar faults and more generally whether syntactically dissimilar faults are far away (semantically) from the real ones. We answer this question by employing 4 fault-seeding techniques (PiTest - a popular mutation testing tool, IBIR - a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Testing and Debugging Techniques · Software Engineering Research · Molecular Biology Techniques and Applications
