MIM-Refiner: A Contrastive Learning Boost from Intermediate Pre-Trained Representations
Benedikt Alkin, Lukas Miklautz, Sepp Hochreiter, Johannes, Brandstetter

TL;DR
MIM-Refiner enhances pre-trained Masked Image Modeling models by using intermediate layer contrastive learning, significantly improving their performance on various downstream tasks with a simple and efficient refinement process.
Contribution
The paper introduces MIM-Refiner, a novel contrastive learning method leveraging intermediate layers of MIM models to boost their representations for downstream tasks.
Findings
Refines MIM features to state-of-the-art within a few epochs.
Achieves 84.7% linear probing accuracy on ImageNet-1K.
Outperforms previous SSL models on multiple benchmarks.
Abstract
We introduce MIM (Masked Image Modeling)-Refiner, a contrastive learning boost for pre-trained MIM models. MIM-Refiner is motivated by the insight that strong representations within MIM models generally reside in intermediate layers. Accordingly, MIM-Refiner leverages multiple contrastive heads that are connected to different intermediate layers. In each head, a modified nearest neighbor objective constructs semantic clusters that capture semantic information which improves performance on downstream tasks, including off-the-shelf and fine-tuning settings. The refinement process is short and simple - yet highly effective. Within a few epochs, we refine the features of MIM models from subpar to state-of-the-art, off-the-shelf features. Refining a ViT-H, pre-trained with data2vec 2.0 on ImageNet-1K, sets a new state-of-the-art in linear probing (84.7%) and low-shot classification among…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Bioinformatics · Neural Networks and Applications · Machine Learning and Algorithms
MethodsContrastive Learning · Mutual Information Machine/Mask Image Modeling
