Show Me What You Don't Know: Efficient Sampling from Invariant Sets for Model Validation

Armand Rousselot; Joran Wendebourg; Ullrich K\"othe

arXiv:2603.21782·cs.LG·March 24, 2026

Show Me What You Don't Know: Efficient Sampling from Invariant Sets for Model Validation

Armand Rousselot, Joran Wendebourg, Ullrich K\"othe

PDF

Open Access

TL;DR

This paper introduces a training-free, diffusion-based method to sample from invariant feature sets of models, enabling visualization and analysis of model invariances without extensive training.

Contribution

It proposes a novel, training-free approach using pretrained diffusion models to efficiently sample from invariance fibers for model validation.

Findings

01

Reveals invariances in models like ResNet, DINO, BiomedClip

02

Identifies concerning invariances such as in medical imaging

03

Demonstrates efficiency over traditional invariance learning methods

Abstract

The performance of machine learning models is determined by the quality of their learned features. They should be invariant under irrelevant data variation but sensitive to task-relevant details. To visualize whether this is the case, we propose a method to analyze feature extractors by sampling from their fibers -- equivalence classes defined by their invariances -- given an arbitrary representative. Unlike existing work where a dedicated generative model is trained for each feature detector, our algorithm is training-free and exploits a pretrained diffusion or flow-matching model as a prior. The fiber loss -- which penalizes mismatch in features -- guides the denoising process toward the desired equivalence class, via non-linear diffusion trajectory matching. This replaces days of training for invariance learning with a single guided generation procedure at comparable fidelity.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFetal and Pediatric Neurological Disorders · Generative Adversarial Networks and Image Synthesis · Advanced Neuroimaging Techniques and Applications