Online Nonconvex Bilevel Optimization with Bregman Divergences

Jason Bohne; David Rosenberg; Gary Kazantsev; and Pawel Polak

arXiv:2409.10470·math.OC·September 17, 2024

Online Nonconvex Bilevel Optimization with Bregman Divergences

Jason Bohne, David Rosenberg, Gary Kazantsev, and Pawel Polak

PDF

Open Access

TL;DR

This paper introduces novel online bilevel optimization algorithms using Bregman divergences, achieving improved regret rates and efficiency for dynamic machine learning tasks like hyperparameter tuning and meta-learning.

Contribution

It presents the first stochastic online bilevel optimizer with variance reduction and a deterministic Bregman-based method that adapts to problem geometry, advancing online bilevel optimization.

Findings

01

OBBO improves sublinear regret rates with hypergradient error decomposition.

02

SOBBO achieves sublinear regret and reduces variance without extra gradient samples.

03

Algorithms outperform existing online and offline bilevel methods in experiments.

Abstract

Bilevel optimization methods are increasingly relevant within machine learning, especially for tasks such as hyperparameter optimization and meta-learning. Compared to the offline setting, online bilevel optimization (OBO) offers a more dynamic framework by accommodating time-varying functions and sequentially arriving data. This study addresses the online nonconvex-strongly convex bilevel optimization problem. In deterministic settings, we introduce a novel online Bregman bilevel optimizer (OBBO) that utilizes adaptive Bregman divergences. We demonstrate that OBBO enhances the known sublinear rates for bilevel local regret through a novel hypergradient error decomposition that adapts to the underlying geometry of the problem. In stochastic contexts, we introduce the first stochastic online bilevel optimizer (SOBBO), which employs a window averaging method for updating outer-level…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOptimization and Variational Analysis · Risk and Portfolio Optimization · Stochastic processes and financial applications