Unshuffling Data for Improved Generalization

Damien Teney; Ehsan Abbasnejad; Anton van den Hengel

arXiv:2002.11894·cs.CV·November 24, 2020·41 cites

Unshuffling Data for Improved Generalization

Damien Teney, Ehsan Abbasnejad, Anton van den Hengel

PDF

Open Access

TL;DR

This paper introduces a method that partitions training data into environments to improve out-of-distribution generalization in neural networks, especially for tasks like visual question answering.

Contribution

It proposes a novel training procedure that leverages environment partitioning to learn stable patterns and reduce spurious correlations, surpassing traditional correlation-based methods.

Findings

01

Significant improvements on VQA-CP dataset.

02

Enhanced generalization on GQA with question annotations.

03

Better multi-dataset training by environment separation.

Abstract

Generalization beyond the training distribution is a core challenge in machine learning. The common practice of mixing and shuffling examples when training neural networks may not be optimal in this regard. We show that partitioning the data into well-chosen, non-i.i.d. subsets treated as multiple training environments can guide the learning of models with better out-of-distribution generalization. We describe a training procedure to capture the patterns that are stable across environments while discarding spurious ones. The method makes a step beyond correlation-based learning: the choice of the partitioning allows injecting information about the task that cannot be otherwise recovered from the joint distribution of the training data. We demonstrate multiple use cases with the task of visual question answering, which is notorious for dataset biases. We obtain significant improvements…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques