Neural Coherence : Find higher performance to out-of-distribution tasks from few samples
Simon Guiroy, Mats Richter, Sarath Chandar, Christopher Pal

TL;DR
This paper introduces Neural Coherence, a data-efficient method for selecting pre-trained models using activation statistics, which improves out-of-distribution task performance with minimal unlabeled target data.
Contribution
It proposes Neural Coherence, a novel approach for model selection based on activation statistics, effective with few unlabeled samples and applicable across various domains and meta-learning scenarios.
Findings
Significantly improves out-of-distribution generalization.
Effective with only a few unlabeled target samples.
Versatile in training data selection.
Abstract
To create state-of-the-art models for many downstream tasks, it has become common practice to fine-tune a pre-trained large vision model. However, it remains an open question of how to best determine which of the many possible model checkpoints resulting from a large training run to use as the starting point. This becomes especially important when data for the target task of interest is scarce, unlabeled and out-of-distribution. In such scenarios, common methods relying on in-distribution validation data become unreliable or inapplicable. This work proposes a novel approach for model selection that operates reliably on just a few unlabeled examples from the target task. Our approach is based on a novel concept: Neural Coherence, which entails characterizing a model's activation statistics for source and target domains, allowing one to define model selection methods with high…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Adversarial Robustness in Machine Learning
