Unsupposable Test-data Generation for Machine-learned Software
Naoto Sato, Hironobu Kuruma, and Hideto Ogawa

TL;DR
This paper introduces a novel method called unsupposable test-data generation (UTG) using variational autoencoders to create data that challenges machine learning models, helping developers evaluate model robustness against unexpected inputs.
Contribution
The paper proposes UTG, a new approach leveraging VAEs to generate unsupposable data, enhancing model testing by producing data with rare or unforeseen features.
Findings
UTG successfully generated unsupposable data for MNIST and House Sales Price datasets.
Generated data revealed new features, aiding developers in creating more robust test cases.
Feasibility of UTG demonstrated through experimental results.
Abstract
As for software development by machine learning, a trained model is evaluated by using part of an existing dataset as test data. However, if data with characteristics that differ from the existing data is input, the model does not always behave as expected. Accordingly, to confirm the behavior of the model more strictly, it is necessary to create data that differs from the existing data and test the model with that different data. The data to be tested includes not only data that developers can suppose (supposable data) but also data they cannot suppose (unsupposable data). To confirm the behavior of the model strictly, it is important to create as much unsupposable data as possible. In this study, therefore, a method called "unsupposable test-data generation" (UTG)---for giving suggestions for unsupposable data to model developers and testers---is proposed. UTG uses a variational…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Machine Learning and Data Classification · Software Testing and Debugging Techniques
MethodsSolana Customer Service Number +1-833-534-1729 · USD Coin Customer Service Number +1-833-534-1729
