Uncertainty-Driven Black-Box Test Data Generation
Neil Walkinshaw, Gordon Fraser

TL;DR
This paper introduces an uncertainty-driven approach to generate test data for black-box systems by inferring behavioral models and selecting tests that reduce uncertainty, improving over traditional random testing methods.
Contribution
It applies a machine learning query strategy framework to test data generation, using genetic programming for model inference and uncertainty sampling to target uncertain system behaviors.
Findings
Uncertainty sampling outperforms conventional testing methods.
The approach effectively targets uncertain parts of the system.
Evaluation on real-world systems shows improved test efficiency.
Abstract
We can never be certain that a software system is correct simply by testing it, but with every additional successful test we become less uncertain about its correctness. In absence of source code or elaborate specifications and models, tests are usually generated or chosen randomly. However, rather than randomly choosing tests, it would be preferable to choose those tests that decrease our uncertainty about correctness the most. In order to guide test generation, we apply what is referred to in Machine Learning as "Query Strategy Framework": We infer a behavioural model of the system under test and select those tests which the inferred model is "least certain" about. Running these tests on the system under test thus directly targets those parts about which tests so far have failed to inform the model. We provide an implementation that uses a genetic programming engine for model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Testing and Debugging Techniques · Software Engineering Research · Machine Learning and Data Classification
