Searching for test data with feature diversity
Robert Feldt, Simon Poulding

TL;DR
This paper explores methods for generating diverse test data based on specific features, demonstrating that search-based approaches like hill climbing can efficiently produce varied inputs tailored to testing needs.
Contribution
It introduces feature-based diversity measures for test data generation and compares search strategies, highlighting the effectiveness of hill climbing in achieving targeted diversity.
Findings
Hill climbing search is efficient for feature diversity.
Feature-based measures offer more control than generic information-theoretic methods.
Trade-offs exist between different search approaches for diverse test data.
Abstract
There is an implicit assumption in software testing that more diverse and varied test data is needed for effective testing and to achieve different types and levels of coverage. Generic approaches based on information theory to measure and thus, implicitly, to create diverse data have also been proposed. However, if the tester is able to identify features of the test data that are important for the particular domain or context in which the testing is being performed, the use of generic diversity measures such as this may not be sufficient nor efficient for creating test inputs that show diversity in terms of these features. Here we investigate different approaches to find data that are diverse according to a specific set of features, such as length, depth of recursion etc. Even though these features will be less general than measures based on information theory, their use may provide a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Testing and Debugging Techniques · Software Engineering Research · Software Reliability and Analysis Research
