Vector Optimization with Stochastic Bandit Feedback

\c{C}a\u{g}{\i}n Ararat; Cem Tekin

arXiv:2110.12311·cs.LG·March 9, 2023

Vector Optimization with Stochastic Bandit Feedback

\c{C}a\u{g}{\i}n Ararat, Cem Tekin

PDF

Open Access

TL;DR

This paper extends vector optimization to stochastic bandit feedback, characterizing sample complexity for identifying approximate Pareto sets using a new cone-dependent complexity measure, with theoretical bounds and experimental validation.

Contribution

It introduces a generalized framework for vector optimization with bandit feedback, defining ordering complexity and analyzing sample complexity bounds for Pareto set identification.

Findings

01

Sample complexity scales quadratically with ordering complexity.

02

Naive elimination algorithm nearly matches worst-case bounds.

03

Experiments confirm theoretical predictions and show impact of parameters.

Abstract

We introduce vector optimization problems with stochastic bandit feedback, in which preferences among designs are encoded by a polyhedral ordering cone $C$ . Our setup generalizes the best arm identification problem to vector-valued rewards by extending the concept of Pareto set beyond multi-objective optimization. We characterize the sample complexity of ( $ϵ, δ$ )-PAC Pareto set identification by defining a new cone-dependent notion of complexity, called the ordering complexity. In particular, we provide gap-dependent and worst-case lower bounds on the sample complexity and show that, in the worst-case, the sample complexity scales with the square of ordering complexity. Furthermore, we investigate the sample complexity of the na\"ive elimination algorithm and prove that it nearly matches the worst-case sample complexity. Finally, we run experiments to verify our theoretical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Advanced Multi-Objective Optimization Algorithms · Gaussian Processes and Bayesian Inference