Representation Learning with Contrastive Predictive Coding
Aaron van den Oord, Yazhe Li, Oriol Vinyals

TL;DR
Contrastive Predictive Coding is a universal unsupervised learning method that learns useful representations by predicting future data in latent space across multiple modalities using contrastive loss.
Contribution
It introduces a novel contrastive predictive coding framework that effectively learns representations for diverse data types using future prediction in latent space.
Findings
Achieves strong performance on speech, images, text, and reinforcement learning tasks.
Utilizes a probabilistic contrastive loss with negative sampling for tractable learning.
Demonstrates the method's versatility across multiple domains.
Abstract
While supervised learning has enabled great progress in many applications, unsupervised learning has not seen such widespread adoption, and remains an important and challenging endeavor for artificial intelligence. In this work, we propose a universal unsupervised learning approach to extract useful representations from high-dimensional data, which we call Contrastive Predictive Coding. The key insight of our model is to learn such representations by predicting the future in latent space by using powerful autoregressive models. We use a probabilistic contrastive loss which induces the latent space to capture information that is maximally useful to predict future samples. It also makes the model tractable by using negative sampling. While most prior work has focused on evaluating representations for a particular modality, we demonstrate that our approach is able to learn useful…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗kietnt0603/nrk-legal-smallmodel· ♡ 1♡ 1
- 🤗linhuixiao/Awesome-Visual-Groundingmodel· ♡ 1♡ 1
- 🤗tomaarsen/mpnet-base-gooaq-mnrl-baselinemodel
- 🤗tomaarsen/distilroberta-base-nli-matryoshka-baselinemodel
- 🤗tomaarsen/distilroberta-base-nli-matryoshka-exact-halvingmodel
- 🤗tomaarsen/splade-distilbert-base-uncased-nq-16bsmodel· 1 dl1 dl
- 🤗tomaarsen/splade-distilbert-base-uncased-nq-16bs-2e-6model· 3 dl· ♡ 13 dl♡ 1
- 🤗tomaarsen/splade-distilbert-base-uncased-nq-16bs-6e-6model· 3 dl3 dl
- 🤗tomaarsen/splade-distilbert-base-uncased-nq-16bs-1e-6model· 1 dl1 dl
- 🤗tomaarsen/splade-distilbert-base-uncased-nq-512bs-1e-5model· 3 dl3 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Speech Recognition and Synthesis · Domain Adaptation and Few-Shot Learning
MethodsTanh Activation · Sigmoid Activation · Long Short-Term Memory · RMSProp · A2C · Step Decay · SGD with Momentum · Gated Recurrent Unit · PixelCNN · Linear Layer
