Quantifying Representation Reliability in Self-Supervised Learning   Models

Young-Jin Park; Hao Wang; Shervin Ardeshir; Navid Azizan

arXiv:2306.00206·cs.LG·May 21, 2024·1 cites

Quantifying Representation Reliability in Self-Supervised Learning Models

Young-Jin Park, Hao Wang, Shervin Ardeshir, Navid Azizan

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel ensemble-based approach to quantify the reliability of self-supervised learning representations without requiring access to downstream task data, using neighborhood consistency across different representation spaces.

Contribution

The authors propose a new method for estimating representation reliability that does not depend on downstream data, leveraging neighborhood consistency and space alignment techniques.

Findings

01

Method accurately correlates with downstream task performance

02

Outperforms baseline methods in reliability estimation

03

Effective across various self-supervised models

Abstract

Self-supervised learning models extract general-purpose representations from data. Quantifying the reliability of these representations is crucial, as many downstream models rely on them as input for their own tasks. To this end, we introduce a formal definition of representation reliability: the representation for a given test point is considered to be reliable if the downstream models built on top of that representation can consistently generate accurate predictions for that test point. However, accessing downstream data to quantify the representation reliability is often infeasible or restricted due to privacy concerns. We propose an ensemble-based method for estimating the representation reliability without knowing the downstream tasks a priori. Our method is based on the concept of neighborhood consistency across distinct pre-trained representation spaces. The key insight is to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

azizanlab/repreli
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Machine Learning and Data Classification · Domain Adaptation and Few-Shot Learning

MethodsTest · ALIGN