
TL;DR
VJE introduces a probabilistic, reconstruction-free framework for self-supervised learning that models embeddings with a Student-t distribution, providing uncertainty estimates and strong out-of-distribution detection.
Contribution
It presents a novel variational approach with a heavy-tailed distribution and shared uncertainty modeling, advancing non-contrastive self-supervised learning.
Findings
VJE achieves competitive accuracy on ImageNet-1K, CIFAR-10/100, STL-10.
Representation likelihoods enable effective out-of-distribution detection.
The framework models structured feature uncertainty directly in embedding space.
Abstract
We introduce Variational Joint Embedding (VJE), a reconstruction-free latent-variable framework for non-contrastive self-supervised learning in representation space. VJE maximizes a symmetric conditional evidence lower bound (ELBO) on paired encoder embeddings by defining a conditional likelihood directly on target representations, rather than optimizing a pointwise compatibility objective. The likelihood is instantiated as a heavy-tailed Student--\(t\) distribution on a polar representation of the target embedding, where a directional--radial decomposition separates angular agreement from magnitude consistency and mitigates norm-induced pathologies. The directional factor operates on the unit sphere, yielding a valid variational bound for the associated spherical subdensity model. An amortized inference network parameterizes a diagonal Gaussian posterior whose feature-wise variances…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
