Prioritization of Metamorphic Relations to reduce the cost of testing
Madhusudan Srinivasan, Upulee Kanewala

TL;DR
This paper introduces a data diversity-based method for prioritizing metamorphic relations in testing machine learning programs, significantly improving fault detection efficiency and reducing testing costs compared to traditional approaches.
Contribution
It proposes a novel prioritization approach based on data diversity, addressing limitations of code coverage methods for machine learning testing.
Findings
Increases fault detection effectiveness by up to 40%.
Reduces time to detect faults by 29%.
Saves testing time and cost.
Abstract
An oracle is a mechanism to decide whether the outputs of the program for the executed test cases are correct. For machine learning programs, such oracle is not available or too difficult to apply. Metamorphic testing is a testing approach that uses metamorphic relations, which are necessary properties of the software under test to help verify the correctness of a program. Prioritization of metamorphic relations helps to reduce the cost of metamorphic testing [1]. However, prioritizing metamorphic relations based on code coverage is often not effective for prioritizing MRs for machine learning programs, since the decision logic of a machine learning model is learned from training data, and 100% code coverage can be easily achieved with a single test input. To this end, in this work, we propose a cost-effective approach based on diversity in the source and follow-up data set to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Testing and Debugging Techniques · Adversarial Robustness in Machine Learning · Software Engineering Research
