Ensemble of Averages: Improving Model Selection and Boosting Performance in Domain Generalization
Devansh Arpit, Huan Wang, Yingbo Zhou, Caiming Xiong

TL;DR
This paper introduces an ensemble of moving average models to improve domain generalization, reducing stochasticity effects and boosting performance on benchmark datasets.
Contribution
It proposes a simple averaging protocol and ensembling moving averages, backed by theoretical analysis, to enhance model robustness and performance in domain generalization.
Findings
Ensembling moving average models significantly improves domain generalization.
The method outperforms vanilla ERM on DomainBed benchmark.
Averaging reduces the impact of stochasticity in training.
Abstract
In Domain Generalization (DG) settings, models trained independently on a given set of training domains have notoriously chaotic performance on distribution shifted test domains, and stochasticity in optimization (e.g. seed) plays a big role. This makes deep learning models unreliable in real world settings. We first show that this chaotic behavior exists even along the training optimization trajectory of a single model, and propose a simple model averaging protocol that both significantly boosts domain generalization and diminishes the impact of stochasticity by improving the rank correlation between the in-domain validation accuracy and out-domain test accuracy, which is crucial for reliable early stopping. Taking advantage of our observation, we show that instead of ensembling unaveraged models (that is typical in practice), ensembling moving average models (EoA) from independent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Cancer-related molecular mechanisms research
MethodsTest · 1x1 Convolution · Residual Connection · Batch Normalization · *Communicated@Fast*How Do I Communicate to Expedia? · Average Pooling · Max Pooling · Global Average Pooling · Bottleneck Residual Block · Residual Block
