Constrained Pareto Set Identification with Bandit Feedback

Cyrille Kone; Emilie Kaufmann; Laura Richert

arXiv:2506.08127·stat.ML·June 11, 2025

Constrained Pareto Set Identification with Bandit Feedback

Cyrille Kone, Emilie Kaufmann, Laura Richert

PDF

Open Access

TL;DR

This paper introduces a new algorithm for efficiently identifying the Pareto Set under constraints in a multivariate bandit setting, with theoretical guarantees and empirical validation showing near-optimal performance.

Contribution

The paper proposes a novel fixed-confidence algorithm for constrained Pareto Set identification that outperforms existing methods and matches theoretical lower bounds.

Findings

01

The algorithm significantly reduces sample complexity compared to racing-like algorithms.

02

Theoretical lower bounds demonstrate near-optimality of the proposed approach.

03

Empirical results confirm the effectiveness across benchmark problems.

Abstract

In this paper, we address the problem of identifying the Pareto Set under feasibility constraints in a multivariate bandit setting. Specifically, given a $K$ -armed bandit with unknown means $μ_{1}, \dots, μ_{K} \in R^{d}$ , the goal is to identify the set of arms whose mean is not uniformly worse than that of another arm (i.e., not smaller for all objectives), while satisfying some known set of linear constraints, expressing, for example, some minimal performance on each objective. Our focus lies in fixed-confidence identification, for which we introduce an algorithm that significantly outperforms racing-like algorithms and the intuitive two-stage approach that first identifies feasible arms and then their Pareto Set. We further prove an information-theoretic lower bound on the sample complexity of any algorithm for constrained Pareto Set identification, showing that the sample…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Advanced Control Systems Optimization · Model Reduction and Neural Networks

MethodsFocus · Sparse Evolutionary Training