Learning useful representations for shifting tasks and distributions

Jianyu Zhang; L\'eon Bottou

arXiv:2212.07346·cs.LG·August 2, 2023

Learning useful representations for shifting tasks and distributions

Jianyu Zhang, L\'eon Bottou

PDF

Open Access 1 Repo

TL;DR

This paper argues that representations learned from multiple training episodes with different random seeds are richer and more adaptable to distribution shifts than those learned from a single episode, supported by theoretical insights and experiments.

Contribution

It introduces the idea that concatenating representations from multiple training runs enhances adaptability to new distributions, highlighting the limitations of single-episode learned representations.

Findings

01

Concatenated representations outperform single-run representations on new distributions.

02

Multiple training episodes produce more diverse and informative representations.

03

Single training episodes tend to produce less redundant and less adaptable features.

Abstract

Does the dominant approach to learn representations (as a side effect of optimizing an expected cost for a single training distribution) remain a good approach when we are dealing with multiple distributions? Our thesis is that such scenarios are better served by representations that are richer than those obtained with a single optimization episode. We support this thesis with simple theoretical arguments and with experiments utilizing an apparently na\"{\i}ve ensembling technique: concatenating the representations obtained from multiple training episodes using the same data, model, algorithm, and hyper-parameters, but different random seeds. These independently trained networks perform similarly. Yet, in a number of scenarios involving new distributions, the concatenated representation performs substantially better than an equivalently sized network trained with a single training run.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tjujianyu/rrl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Neural Networks and Applications · Machine Learning and Algorithms