Quantifying Representation Reliability in Self-Supervised Learning Models
Young-Jin Park, Hao Wang, Shervin Ardeshir, Navid Azizan

TL;DR
This paper introduces a novel ensemble-based approach to quantify the reliability of self-supervised learning representations without requiring access to downstream task data, using neighborhood consistency across different representation spaces.
Contribution
The authors propose a new method for estimating representation reliability that does not depend on downstream data, leveraging neighborhood consistency and space alignment techniques.
Findings
Method accurately correlates with downstream task performance
Outperforms baseline methods in reliability estimation
Effective across various self-supervised models
Abstract
Self-supervised learning models extract general-purpose representations from data. Quantifying the reliability of these representations is crucial, as many downstream models rely on them as input for their own tasks. To this end, we introduce a formal definition of representation reliability: the representation for a given test point is considered to be reliable if the downstream models built on top of that representation can consistently generate accurate predictions for that test point. However, accessing downstream data to quantify the representation reliability is often infeasible or restricted due to privacy concerns. We propose an ensemble-based method for estimating the representation reliability without knowing the downstream tasks a priori. Our method is based on the concept of neighborhood consistency across distinct pre-trained representation spaces. The key insight is to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Machine Learning and Data Classification · Domain Adaptation and Few-Shot Learning
MethodsTest · ALIGN
