LeJEPA: Provable and Scalable Self-Supervised Learning Without the Heuristics

Randall Balestriero; Yann LeCun

arXiv:2511.08544·cs.LG·November 17, 2025

LeJEPA: Provable and Scalable Self-Supervised Learning Without the Heuristics

Randall Balestriero, Yann LeCun

PDF

Open Access 1 Models

TL;DR

LeJEPA introduces a theoretically grounded, scalable self-supervised learning method that simplifies training by eliminating heuristics and demonstrates strong empirical performance across diverse datasets and architectures.

Contribution

It provides a comprehensive theory for JEPAs, introduces SIGReg for optimal embedding distribution, and offers a practical, stable, and heuristics-free training approach.

Findings

01

Achieves 79% accuracy on ImageNet-1K with ViT-H/14.

02

Demonstrates stability across various architectures and domains.

03

Requires minimal code and hyper-parameter tuning.

Abstract

Learning manipulable representations of the world and its dynamics is central to AI. Joint-Embedding Predictive Architectures (JEPAs) offer a promising blueprint, but lack of practical guidance and theory has led to ad-hoc R&D. We present a comprehensive theory of JEPAs and instantiate it in {\bf LeJEPA}, a lean, scalable, and theoretically grounded training objective. First, we identify the isotropic Gaussian as the optimal distribution that JEPAs' embeddings should follow to minimize downstream prediction risk. Second, we introduce a novel objective--{\bf Sketched Isotropic Gaussian Regularization} (SIGReg)--to constrain embeddings to reach that ideal distribution. Combining the JEPA predictive loss with SIGReg yields LeJEPA with numerous theoretical and practical benefits: (i) single trade-off hyperparameter, (ii) linear time and memory complexity, (iii) stability across…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
gajeshladhar/core-jepa
model· ♡ 25
♡ 25

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)