A Unified and Stable Risk Minimization Framework for Weakly Supervised Learning with Theoretical Guarantees

Miao Zhang; Junpeng Li; Changchun Hua; Yana Yang

arXiv:2511.22823·cs.LG·December 1, 2025

A Unified and Stable Risk Minimization Framework for Weakly Supervised Learning with Theoretical Guarantees

Miao Zhang, Junpeng Li, Changchun Hua, Yana Yang

PDF

Open Access

TL;DR

This paper introduces a unified, stable risk minimization framework for various weakly supervised learning settings, providing theoretical guarantees and demonstrating consistent empirical improvements without heuristic stabilization.

Contribution

It proposes a single optimization-based framework that subsumes multiple weak supervision types and offers theoretical analysis including generalization bounds and robustness to class-prior misspecification.

Findings

01

Consistent empirical gains across datasets and supervision types.

02

Robustness to class-prior misspecification and overfitting.

03

Theoretical guarantees including generalization bounds and identifiability conditions.

Abstract

Weakly supervised learning has emerged as a practical alternative to fully supervised learning when complete and accurate labels are costly or infeasible to acquire. However, many existing methods are tailored to specific supervision patterns -- such as positive-unlabeled (PU), unlabeled-unlabeled (UU), complementary-label (CLL), partial-label (PLL), or similarity-unlabeled annotations -- and rely on post-hoc corrections to mitigate instability induced by indirect supervision. We propose a principled, unified framework that bypasses such post-hoc adjustments by directly formulating a stable surrogate risk grounded in the structure of weakly supervised data. The formulation naturally subsumes diverse settings -- including PU, UU, CLL, PLL, multi-class unlabeled, and tuple-based learning -- under a single optimization objective. We further establish a non-asymptotic generalization bound…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Domain Adaptation and Few-Shot Learning · Imbalanced Data Classification Techniques