PAC-Bayesian Contrastive Unsupervised Representation Learning

Kento Nozawa; Pascal Germain; Benjamin Guedj

arXiv:1910.04464·cs.LG·July 20, 2020·5 cites

PAC-Bayesian Contrastive Unsupervised Representation Learning

Kento Nozawa, Pascal Germain, Benjamin Guedj

PDF

Open Access 1 Repo

TL;DR

This paper extends theoretical understanding of contrastive unsupervised representation learning (CURL) by deriving PAC-Bayesian generalisation bounds, leading to a new algorithm that performs competitively on real datasets.

Contribution

It introduces PAC-Bayesian generalisation bounds for CURL and develops a new representation learning algorithm based on these bounds.

Findings

01

The new algorithm achieves competitive accuracy on real datasets.

02

The bounds are non-vacuous and provide meaningful generalisation guarantees.

03

Empirical results support the theoretical analysis.

Abstract

Contrastive unsupervised representation learning (CURL) is the state-of-the-art technique to learn representations (as a set of features) from unlabelled data. While CURL has collected several empirical successes recently, theoretical understanding of its performance was still missing. In a recent work, Arora et al. (2019) provide the first generalisation bounds for CURL, relying on a Rademacher complexity. We extend their framework to the flexible PAC-Bayes setting, allowing us to deal with the non-iid setting. We present PAC-Bayesian generalisation bounds for CURL, which are then used to derive a new representation learning algorithm. Numerical experiments on real-life datasets illustrate that our algorithm achieves competitive accuracy, and yields non-vacuous generalisation bounds.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nzw0301/pb-contrastive
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Algorithms · Topic Modeling