Generalization Analysis for Supervised Contrastive Representation Learning under Non-IID Settings

Nong Minh Hieu; Antoine Ledent

arXiv:2505.04937·stat.ML·May 29, 2025

Generalization Analysis for Supervised Contrastive Representation Learning under Non-IID Settings

Nong Minh Hieu, Antoine Ledent

PDF

Open Access 1 Video

TL;DR

This paper analyzes the generalization behavior of Contrastive Representation Learning (CRL) in non-i.i.d. settings, providing theoretical bounds that better reflect practical scenarios where data is recycled across tuples.

Contribution

It introduces the first generalization bounds for CRL under non-i.i.d. conditions, extending theoretical understanding to more realistic data reuse scenarios.

Findings

01

Generalization bounds scale logarithmically with the class covering number.

02

Sample complexity depends on the number of classes and feature class complexity.

03

Bounds are derived for linear and neural network function classes.

Abstract

Contrastive Representation Learning (CRL) has achieved impressive success in various domains in recent years. Nevertheless, the theoretical understanding of the generalization behavior of CRL has remained limited. Moreover, to the best of our knowledge, the current literature only analyzes generalization bounds under the assumption that the data tuples used for contrastive learning are independently and identically distributed. However, in practice, we are often limited to a fixed pool of reusable labeled data points, making it inevitable to recycle data across tuples to create sufficiently large datasets. Therefore, the tuple-wise independence condition imposed by previous works is invalidated. In this paper, we provide a generalization analysis for the CRL framework under non- $i . i . d .$ settings that adheres to practice more realistically. Drawing inspiration from the literature on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Generalization Analysis for Supervised Contrastive Representation Learning under Non-IID Settings· slideslive

Taxonomy

TopicsFace and Expression Recognition · Domain Adaptation and Few-Shot Learning · Face recognition and analysis

MethodsContrastive Learning