From Alignment to Prediction: A Study of Self-Supervised Learning and Predictive Representation Learning

Mintu Dutta; Ritesh Vyas; and Mohendra Roy

arXiv:2604.13518·cs.LG·April 16, 2026

From Alignment to Prediction: A Study of Self-Supervised Learning and Predictive Representation Learning

Mintu Dutta, Ritesh Vyas, and Mohendra Roy

PDF

TL;DR

This paper introduces Predictive Representation Learning (PRL) as a new paradigm in self-supervised learning, emphasizing latent prediction of unobserved data components, and compares it with existing alignment and reconstruction methods.

Contribution

It defines PRL, proposes a taxonomy including PRL, alignment, and reconstruction, and analyzes architectures like I-JEPA, providing theoretical insights and empirical comparisons.

Findings

01

MAE achieves perfect similarity of 1.00

02

BYOL and I-JEPA attain high accuracy of 0.98 and 0.95

03

I-JEPA shows strong robustness scores of 0.78

Abstract

Self-supervised learning has emerged as a major technique for the task of learning from unlabeled data, where the current methods mostly revolve around alignment of representations and input recon struction. Although such approaches have demonstrated excellent performance in practice, their scope remains mostly confined to learning from observed data and does not provide much help in terms of a learning structure that is predictive of the data distribution. In this paper, we study some of the recent developments in the realm of self-supervised learning. We define a new category called Predictive Representation Learning (PRL), which revolves around the latent prediction of unobserved components of data based on the observation. We propose a common taxonomy that classifies PRL along with alignment and reconstruction-based learning approaches. Furthermore, we argue that Joint-Embedding…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.