Optimal Scalarizations for Sublinear Hypervolume Regret
Qiuyi Zhang (Richard)

TL;DR
This paper introduces non-linear hypervolume scalarizations with random weights for multiobjective optimization, achieving optimal regret bounds and outperforming linear scalarizations in empirical tests.
Contribution
It demonstrates that hypervolume scalarizations with random weights achieve optimal sublinear regret and provides a novel non-Euclidean analysis for multiobjective stochastic linear bandits.
Findings
Hypervolume scalarizations with random weights achieve $O(T^{-1/k})$ regret.
The proposed approach outperforms linear scalarizations and standard algorithms empirically.
Theoretical bounds match lower bounds, confirming optimality.
Abstract
Scalarization is a general, parallizable technique that can be deployed in any multiobjective setting to reduce multiple objectives into one, yet some have dismissed this versatile approach because linear scalarizations cannot explore concave regions of the Pareto frontier. To that end, we aim to find simple non-linear scalarizations that provably explore a diverse set of objectives on the Pareto frontier, as measured by the dominated hypervolume. We show that hypervolume scalarizations with uniformly random weights achieves an optimal sublinear hypervolume regret bound of , with matching lower bounds that preclude any algorithm from doing better asymptotically. For the setting of multiobjective stochastic linear bandits, we utilize properties of hypervolume scalarizations to derive a novel non-Euclidean analysis to get regret bounds of $\tilde{O}( d T^{-1/2} +…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Gaussian Processes and Bayesian Inference · Risk and Portfolio Optimization
MethodsALIGN
