Improving OOD Generalization of Pre-trained Encoders via Aligned   Embedding-Space Ensembles

Shuman Peng; Arash Khoeini; Sharan Vaswani; Martin Ester

arXiv:2411.13073·cs.LG·November 21, 2024

Improving OOD Generalization of Pre-trained Encoders via Aligned Embedding-Space Ensembles

Shuman Peng, Arash Khoeini, Sharan Vaswani, Martin Ester

PDF

Open Access

TL;DR

This paper introduces a method to improve out-of-distribution generalization of pre-trained encoders by aligning embedding spaces in an ensemble, enhancing their robustness without requiring labels.

Contribution

It presents a novel unsupervised approach to align embedding spaces in ensembles, backed by theoretical analysis and demonstrated on MNIST.

Findings

01

Improved embedding quality on in-distribution data.

02

Enhanced OOD generalization of pre-trained encoders.

03

Effective unsupervised alignment method for embedding spaces.

Abstract

The quality of self-supervised pre-trained embeddings on out-of-distribution (OOD) data is poor without fine-tuning. A straightforward and simple approach to improving the generalization of pre-trained representation to OOD data is the use of deep ensembles. However, obtaining an effective ensemble in the embedding space with only unlabeled data remains an unsolved problem. We first perform a theoretical analysis that reveals the relationship between individual hyperspherical embedding spaces in an ensemble. We then design a principled method to align these embedding spaces in an unsupervised manner. Experimental results on the MNIST dataset show that our embedding-space ensemble method improves pre-trained embedding quality on in-distribution and OOD data compared to single encoders.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Human Pose and Action Recognition · Machine Learning and Data Classification

MethodsALIGN