On the sample complexity of semi-supervised multi-objective learning

Tobias Wegel; Geelon So; Junhyung Park; Fanny Yang

arXiv:2508.17152·stat.ML·August 26, 2025

On the sample complexity of semi-supervised multi-objective learning

Tobias Wegel, Geelon So, Junhyung Park, Fanny Yang

PDF

1 Video

TL;DR

This paper investigates the sample complexity in semi-supervised multi-objective learning, revealing conditions where unlabeled data can reduce labeled data needs, especially with Bregman losses.

Contribution

It provides theoretical bounds on sample complexity for MOL, highlighting when unlabeled data helps and introducing a semi-supervised pseudo-labeling algorithm.

Findings

01

Unavoidable complexity cost for some losses even with ideal semi-supervised data.

02

For Bregman losses, unlabeled data can significantly reduce labeled data requirements.

03

Proposed simple semi-supervised algorithm achieves the derived bounds.

Abstract

In multi-objective learning (MOL), several possibly competing prediction tasks must be solved jointly by a single model. Achieving good trade-offs may require a model class $G$ with larger capacity than what is necessary for solving the individual tasks. This, in turn, increases the statistical cost, as reflected in known MOL bounds that depend on the complexity of $G$ . We show that this cost is unavoidable for some losses, even in an idealized semi-supervised setting, where the learner has access to the Bayes-optimal solutions for the individual tasks as well as the marginal distributions over the covariates. On the other hand, for objectives defined with Bregman losses, we prove that the complexity of $G$ may come into play only in terms of unlabeled data. Concretely, we establish sample complexity upper bounds, showing precisely when and how unlabeled…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

On the sample complexity of semi-supervised multi-objective learning· slideslive