Optimal Scalarizations for Sublinear Hypervolume Regret

Qiuyi Zhang (Richard)

arXiv:2307.03288·cs.LG·November 6, 2024

Optimal Scalarizations for Sublinear Hypervolume Regret

Qiuyi Zhang (Richard)

PDF

Open Access

TL;DR

This paper introduces non-linear hypervolume scalarizations with random weights for multiobjective optimization, achieving optimal regret bounds and outperforming linear scalarizations in empirical tests.

Contribution

It demonstrates that hypervolume scalarizations with random weights achieve optimal sublinear regret and provides a novel non-Euclidean analysis for multiobjective stochastic linear bandits.

Findings

01

Hypervolume scalarizations with random weights achieve $O(T^{-1/k})$ regret.

02

The proposed approach outperforms linear scalarizations and standard algorithms empirically.

03

Theoretical bounds match lower bounds, confirming optimality.

Abstract

Scalarization is a general, parallizable technique that can be deployed in any multiobjective setting to reduce multiple objectives into one, yet some have dismissed this versatile approach because linear scalarizations cannot explore concave regions of the Pareto frontier. To that end, we aim to find simple non-linear scalarizations that provably explore a diverse set of $k$ objectives on the Pareto frontier, as measured by the dominated hypervolume. We show that hypervolume scalarizations with uniformly random weights achieves an optimal sublinear hypervolume regret bound of $O (T^{- 1/ k})$ , with matching lower bounds that preclude any algorithm from doing better asymptotically. For the setting of multiobjective stochastic linear bandits, we utilize properties of hypervolume scalarizations to derive a novel non-Euclidean analysis to get regret bounds of $\tilde{O}( d T^{-1/2} +…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Gaussian Processes and Bayesian Inference · Risk and Portfolio Optimization

MethodsALIGN