Learning Robust Representations via Multi-View Information Bottleneck
Marco Federici, Anjan Dutta, Patrick Forr\'e, Nate Kushman, Zeynep, Akata

TL;DR
This paper introduces a multi-view unsupervised information bottleneck method that identifies relevant features shared across views to learn robust representations without labels, achieving state-of-the-art results.
Contribution
It extends the information bottleneck principle to multi-view unsupervised learning, enabling representation learning without labels by focusing on shared information across views.
Findings
Achieves state-of-the-art results on Sketchy and MIR-Flickr datasets.
Demonstrates improved generalization with data augmentation in single-view setting.
Provides theoretical analysis linking multi-view sharing to representation quality.
Abstract
The information bottleneck principle provides an information-theoretic method for representation learning, by training an encoder to retain all information which is relevant for predicting the label while minimizing the amount of other, excess information in the representation. The original formulation, however, requires labeled data to identify the superfluous information. In this work, we extend this ability to the multi-view unsupervised setting, where two views of the same underlying entity are provided but the label is unknown. This enables us to identify superfluous information as that not shared by both views. A theoretical analysis leads to the definition of a new multi-view model that produces state-of-the-art results on the Sketchy dataset and label-limited versions of the MIR-Flickr dataset. We also extend our theory to the single-view setting by taking advantage of standard…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications
