Targeted Deep Learning System Boundary Testing
Oliver Wei{\ss}l, Amr Abdellatif, Xingcheng Chen, Giorgi Merabishvili, Vincenzo Riccio, Severin Kacianka, Andrea Stocco

TL;DR
This paper introduces Mimicry, a black-box testing method using style-based GANs to generate targeted, semantically meaningful boundary inputs for deep learning systems, improving reliability assessment.
Contribution
Mimicry is a novel boundary testing approach that leverages style-based GANs for controlled input generation, addressing limitations of prior untargeted methods.
Findings
Mimicry consistently finds inputs closer to decision boundaries.
It generates semantically meaningful boundary cases revealing new behaviors.
Maintains effectiveness with increasing dataset complexity.
Abstract
Evaluating the behavioral boundaries of deep learning (DL) systems is crucial for understanding their reliability across diverse, unseen inputs. Existing solutions fall short as they rely on untargeted random, model- or latent-based perturbations, due to difficulties in generating controlled input variations. In this work, we introduce Mimicry, a novel black-box test generator for fine-grained, targeted exploration of DL system boundaries. Mimicry performs boundary testing by leveraging the probabilistic nature of DL outputs to identify promising directions for exploration. It uses style-based GANs to disentangle input representations into content and style components, enabling controlled feature mixing to approximate the decision boundary. We evaluated Mimicry's effectiveness in generating boundary inputs for five widely used DL image classification systems of increasing complexity,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Image Processing and 3D Reconstruction · Time Series Analysis and Forecasting
