Towards Efficient Contrastive PAC Learning

Jie Shen

arXiv:2502.15962·cs.LG·July 8, 2025

Towards Efficient Contrastive PAC Learning

Jie Shen

PDF

Open Access

TL;DR

This paper investigates the PAC learning framework for contrastive learning of linear representations, establishing intractability results, proposing a semi-definite relaxation, and providing the first efficient PAC learning algorithm with generalization guarantees.

Contribution

It introduces the first efficient PAC learning algorithm for contrastive learning of linear representations, including theoretical analysis and relaxation techniques.

Findings

01

Contrastive PAC learning of linear representations is generally intractable.

02

A semi-definite program relaxation is effective when using the $\,\ell_2$-norm.

03

The proposed algorithm has proven generalization guarantees.

Abstract

We study contrastive learning under the PAC learning framework. While a series of recent works have shown statistical results for learning under contrastive loss, based either on the VC-dimension or Rademacher complexity, their algorithms are inherently inefficient or not implying PAC guarantees. In this paper, we consider contrastive learning of the fundamental concept of linear representations. Surprisingly, even under such basic setting, the existence of efficient PAC learners is largely open. We first show that the problem of contrastive PAC learning of linear representations is intractable to solve in general. We then show that it can be relaxed to a semi-definite program when the distance between contrastive samples is measured by the $ℓ_{2}$ -norm. We then establish generalization guarantees based on Rademacher complexity, and connect it to PAC guarantees under certain…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Speech Recognition and Synthesis · Handwritten Text Recognition Techniques

MethodsContrastive Learning