Generalizing to any diverse distribution: uniformity, gentle finetuning and rebalancing
Andreas Loukas, Karolis Martinkus, Ed Wagstaff, Kyunghyun Cho

TL;DR
This paper investigates how to achieve robust model generalization across diverse test distributions by analyzing uniform training and rebalancing techniques, supported by theoretical insights and empirical validation.
Contribution
It introduces a conservative framework for out-of-distribution generalization, proving uniform training as optimal and analyzing rebalancing methods for practical scenarios.
Findings
Training on a uniform distribution is optimal for diverse test distributions.
Rebalancing and finetuning can mitigate non-uniformity effects.
Theoretical grounding for entropy and rebalancing in o.o.d. generalization.
Abstract
As training datasets grow larger, we aspire to develop models that generalize well to any diverse test distribution, even if the latter deviates significantly from the training data. Various approaches like domain adaptation, domain generalization, and robust optimization attempt to address the out-of-distribution challenge by posing assumptions about the relation between training and test distribution. Differently, we adopt a more conservative perspective by accounting for the worst-case error across all sufficiently diverse test distributions within a known domain. Our first finding is that training on a uniform distribution over this domain is optimal. We also interrogate practical remedies when uniform samples are unavailable by considering methods for mitigating non-uniformity through finetuning and rebalancing. Our theory provides a mathematical grounding for previous observations…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsForecasting Techniques and Applications · Statistical Distribution Estimation and Applications
