On Discriminative Probabilistic Modeling for Self-Supervised   Representation Learning

Bokun Wang; Yunwen Lei; Yiming Ying; Tianbao Yang

arXiv:2410.09156·cs.LG·March 6, 2025

On Discriminative Probabilistic Modeling for Self-Supervised Representation Learning

Bokun Wang, Yunwen Lei, Yiming Ying, Tianbao Yang

PDF

Open Access 1 Video

TL;DR

This paper introduces a probabilistic modeling framework for self-supervised learning, utilizing multiple importance sampling to improve contrastive loss, and proposes a new non-parametric method that outperforms existing baselines in image-language pretraining.

Contribution

It develops a novel probabilistic approach with a new contrastive objective and efficient algorithm, addressing limitations of current InfoNCE-based methods in self-supervised learning.

Findings

01

Superior performance on CC3M and CC12M datasets

02

Effective Monte Carlo integration via MIS

03

New contrastive objective improves learning quality

Abstract

We study the discriminative probabilistic modeling on a continuous domain for the data prediction task of (multimodal) self-supervised representation learning. To address the challenge of computing the integral in the partition function for each anchor data, we leverage the multiple importance sampling (MIS) technique for robust Monte Carlo integration, which can recover InfoNCE-based contrastive loss as a special case. Within this probabilistic modeling framework, we conduct generalization error analysis to reveal the limitation of current InfoNCE-based contrastive loss for self-supervised representation learning and derive insights for developing better approaches by reducing the error of Monte Carlo integration. To this end, we propose a novel non-parametric method for approximating the sum of conditional probability densities required by MIS through convex optimization, yielding a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

On Discriminative Probabilistic Modeling for Self-Supervised Representation Learning· slideslive

Taxonomy

TopicsText and Document Classification Technologies · Machine Learning and Data Classification