LiDAR: Sensing Linear Probing Performance in Joint Embedding SSL Architectures
Vimal Thilak, Chen Huang, Omid Saremi, Laurent Dinh and, Hanlin Goh, Preetum Nakkiran, Joshua M. Susskind, Etai Littwin

TL;DR
LiDAR is a new metric for evaluating the quality of learned representations in joint embedding SSL architectures, addressing the challenge of assessment without downstream task access, and improving hyperparameter tuning.
Contribution
Introduces LiDAR, a novel metric based on LDA rank, to reliably evaluate and compare representations in joint embedding SSL models.
Findings
LiDAR outperforms existing covariance rank methods in predicting optimal hyperparameters.
LiDAR provides a more robust and intuitive assessment of representation quality.
Empirical results demonstrate LiDAR's effectiveness across different models and datasets.
Abstract
Joint embedding (JE) architectures have emerged as a promising avenue for acquiring transferable data representations. A key obstacle to using JE methods, however, is the inherent challenge of evaluating learned representations without access to a downstream task, and an annotated dataset. Without efficient and reliable evaluation, it is difficult to iterate on architectural and training choices for JE methods. In this paper, we introduce LiDAR (Linear Discriminant Analysis Rank), a metric designed to measure the quality of representations within JE architectures. Our metric addresses several shortcomings of recent approaches based on feature covariance rank by discriminating between informative and uninformative features. In essence, LiDAR quantifies the rank of the Linear Discriminant Analysis (LDA) matrix associated with the surrogate SSL task -- a measure that intuitively captures…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Machine Learning and Data Classification · Anomaly Detection Techniques and Applications
