A Fully Single Loop Algorithm for Bilevel Optimization without Hessian Inverse
Junyi Li, Bin Gu, Heng Huang

TL;DR
This paper introduces a novel fully single loop bilevel optimization algorithm that avoids Hessian inverse computations, improving efficiency while maintaining convergence guarantees, and demonstrates its effectiveness on machine learning tasks.
Contribution
The paper develops the first fully single loop bilevel optimization algorithm that does not require Hessian inverse, with theoretical convergence analysis and empirical validation.
Findings
Converges with rate O(ε^{-2})
Effectively handles hyper-gradient approximation
Outperforms double loop methods in experiments
Abstract
In this paper, we propose a new Hessian inverse free Fully Single Loop Algorithm (FSLA) for bilevel optimization problems. Classic algorithms for bilevel optimization admit a double loop structure which is computationally expensive. Recently, several single loop algorithms have been proposed with optimizing the inner and outer variable alternatively. However, these algorithms not yet achieve fully single loop. As they overlook the loop needed to evaluate the hyper-gradient for a given inner and outer state. In order to develop a fully single loop algorithm, we first study the structure of the hyper-gradient and identify a general approximation formulation of hyper-gradient computation that encompasses several previous common approaches, e.g. back-propagation through time, conjugate gradient, \emph{etc.} Based on this formulation, we introduce a new state variable to maintain the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsPediatric Hepatobiliary Diseases and Treatments · Optimization and Variational Analysis · Advanced Optimization Algorithms Research
