Mutant reduction evaluation: what is there and what is missing?
Peng Zhang, Yang Wang, Xutong Liu, Yanhui Li, Yibao Yang, Ziyuan Wang,, Xiaoyu Zhou, Lin Chen, Yuming Zhou

TL;DR
This paper introduces new evaluation indicators, OP and EROP, to measure how well mutation reduction strategies preserve the order of fault detection potential, addressing a gap in existing assessment methods.
Contribution
The paper proposes novel indicators, OP and EROP, for assessing the order-preserving ability of mutation reduction strategies, enhancing evaluation accuracy.
Findings
OP and EROP effectively measure order-preserving ability.
They outperform existing indicators in distinguishing strategies.
They provide efficient and meaningful evaluation of mutant reduction strategies.
Abstract
Background. Many mutation reduction strategies, which aim to reduce the number of mutants, have been proposed. Problem. It is important to measure the ability of a mutation reduction strategy to maintain test suite effectiveness evaluation. However, existing evaluation indicators are unable to measure the "order-preserving ability". Objective. We aim to propose evaluation indicators to measure the "order-preserving ability" of a mutation reduction strategy, which is important but missing in our community. Method. Given a test suite on a Software Under Test (SUT) with a set of original mutants, we leverage the test suite to generate a group of test suites that have a partial order relationship in fault detecting potential. When evaluating a reduction strategy, we first construct two partial order relationships among the generated test suites in terms of mutation score, one with the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Testing and Debugging Techniques · Software Engineering Research · Software Reliability and Analysis Research
