Towards Fundamental Limits for Active Multi-distribution Learning

Chicheng Zhang; Yihan Zhou

arXiv:2506.17607·cs.LG·June 24, 2025

Towards Fundamental Limits for Active Multi-distribution Learning

Chicheng Zhang, Yihan Zhou

PDF

TL;DR

This paper advances the understanding of active multi-distribution learning by developing new algorithms and establishing optimal bounds on label complexity, addressing both realizable and agnostic scenarios with theoretical rigor.

Contribution

It introduces new algorithms for active multi-distribution learning and proves optimal upper and lower bounds on label complexity in various settings.

Findings

01

Proves an upper bound of ( heta_{ ext{max}}(d+k)\u2206rac{1}{\u03b5}) in the near-realizable setting.

02

Establishes an upper bound involving ( heta_{ ext{max}}(d+k)(\u2206rac{1}{\u03b5}+rac{ u^2}{\u03b5^2})+rac{k u}{\u03b5^2}) in the agnostic setting.

03

Shows the realizable setting bound is information-theoretically optimal.

Abstract

Multi-distribution learning extends agnostic Probably Approximately Correct (PAC) learning to the setting in which a family of $k$ distributions, ${D_{i}}_{i \in [k]}$ , is considered and a classifier's performance is measured by its error under the worst distribution. This problem has attracted a lot of recent interests due to its applications in collaborative learning, fairness, and robustness. Despite a rather complete picture of sample complexity of passive multi-distribution learning, research on active multi-distribution learning remains scarce, with algorithms whose optimality remaining unknown. In this paper, we develop new algorithms for active multi-distribution learning and establish improved label complexity upper and lower bounds, in distribution-dependent and distribution-free settings. Specifically, in the near-realizable setting we prove an upper bound of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.