Unshuffling Data for Improved Generalization
Damien Teney, Ehsan Abbasnejad, Anton van den Hengel

TL;DR
This paper introduces a method that partitions training data into environments to improve out-of-distribution generalization in neural networks, especially for tasks like visual question answering.
Contribution
It proposes a novel training procedure that leverages environment partitioning to learn stable patterns and reduce spurious correlations, surpassing traditional correlation-based methods.
Findings
Significant improvements on VQA-CP dataset.
Enhanced generalization on GQA with question annotations.
Better multi-dataset training by environment separation.
Abstract
Generalization beyond the training distribution is a core challenge in machine learning. The common practice of mixing and shuffling examples when training neural networks may not be optimal in this regard. We show that partitioning the data into well-chosen, non-i.i.d. subsets treated as multiple training environments can guide the learning of models with better out-of-distribution generalization. We describe a training procedure to capture the patterns that are stable across environments while discarding spurious ones. The method makes a step beyond correlation-based learning: the choice of the partitioning allows injecting information about the task that cannot be otherwise recovered from the joint distribution of the training data. We demonstrate multiple use cases with the task of visual question answering, which is notorious for dataset biases. We obtain significant improvements…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
