TL;DR
This paper introduces a framework for adding augmentation invariance to pretrained networks using lightweight adapters trained with novel loss functions, achieving high accuracy on rotated and noisy images without fine-tuning the original model.
Contribution
The work proposes a new post-training method for augmentation invariance using probabilistic encoders and introduces two effective loss functions, demonstrating significant improvements in invariance tasks.
Findings
Achieves 94% accuracy on rotated STL10 images with frozen pretrained features.
Boosts noise-invariant classification from 58% to 86%.
Adapter networks trained with proposed losses preserve original feature space integrity.
Abstract
This work develops a framework for post-training augmentation invariance, in which our goal is to add invariance properties to a pretrained network without altering its behavior on the original, non-augmented input distribution. We define this notion precisely and additionally introduce augmented encoders, which are probabilistic encoders that formalize augmentation-based encoding processes and that serve as our fundamental object of study. We introduce two losses for augmented encoders, namely, Markov-Wasserstein minimization and Wasserstein correlation maximization, and we demonstrate empirically that both losses can be used to train lightweight, one-hidden-layer MLP adapter networks E_theta that, when appended to the latent space of a pretrained network F, do indeed lead to (approximate) post-training augmentation invariance. For example, on STL10 with F = DINOv2 features, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
