Learn to Expect the Unexpected: Probably Approximately Correct Domain   Generalization

Vikas K. Garg; Adam Kalai; Katrina Ligett; and Zhiwei Steven Wu

arXiv:2002.05660·cs.LG·February 14, 2020·6 cites

Learn to Expect the Unexpected: Probably Approximately Correct Domain Generalization

Vikas K. Garg, Adam Kalai, Katrina Ligett, and Zhiwei Steven Wu

PDF

Open Access

TL;DR

This paper introduces a theoretical framework for domain generalization using a meta-distribution over data domains, demonstrating polynomial-sample learnability and practical feature selection improvements.

Contribution

It proposes a simple, unified theoretical model for domain generalization with multiple problem settings and validates the approach with experiments on feature selection.

Findings

01

Polynomial-sample domain generalization is achievable in various settings.

02

The feature selection algorithm effectively ignores spurious correlations.

03

Experiments show improved generalization performance.

Abstract

Domain generalization is the problem of machine learning when the training data and the test data come from different data domains. We present a simple theoretical model of learning to generalize across domains in which there is a meta-distribution over data distributions, and those data distributions may even have different supports. In our model, the training data given to a learning algorithm consists of multiple datasets each from a single domain drawn in turn from the meta-distribution. We study this model in three different problem settings---a multi-domain Massart noise setting, a decision tree multi-dataset setting, and a feature selection setting, and find that computationally efficient, polynomial-sample domain generalization is possible in each. Experiments demonstrate that our feature selection algorithm indeed ignores spurious correlations and improves generalization.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Algorithms · Multimodal Machine Learning Applications

MethodsTest · Feature Selection