Learn to Expect the Unexpected: Probably Approximately Correct Domain Generalization
Vikas K. Garg, Adam Kalai, Katrina Ligett, and Zhiwei Steven Wu

TL;DR
This paper introduces a theoretical framework for domain generalization using a meta-distribution over data domains, demonstrating polynomial-sample learnability and practical feature selection improvements.
Contribution
It proposes a simple, unified theoretical model for domain generalization with multiple problem settings and validates the approach with experiments on feature selection.
Findings
Polynomial-sample domain generalization is achievable in various settings.
The feature selection algorithm effectively ignores spurious correlations.
Experiments show improved generalization performance.
Abstract
Domain generalization is the problem of machine learning when the training data and the test data come from different data domains. We present a simple theoretical model of learning to generalize across domains in which there is a meta-distribution over data distributions, and those data distributions may even have different supports. In our model, the training data given to a learning algorithm consists of multiple datasets each from a single domain drawn in turn from the meta-distribution. We study this model in three different problem settings---a multi-domain Massart noise setting, a decision tree multi-dataset setting, and a feature selection setting, and find that computationally efficient, polynomial-sample domain generalization is possible in each. Experiments demonstrate that our feature selection algorithm indeed ignores spurious correlations and improves generalization.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Algorithms · Multimodal Machine Learning Applications
MethodsTest · Feature Selection
