A Fully Single Loop Algorithm for Bilevel Optimization without Hessian   Inverse

Junyi Li; Bin Gu; Heng Huang

arXiv:2112.04660·cs.LG·December 13, 2021

A Fully Single Loop Algorithm for Bilevel Optimization without Hessian Inverse

Junyi Li, Bin Gu, Heng Huang

PDF

Open Access 1 Video

TL;DR

This paper introduces a novel fully single loop bilevel optimization algorithm that avoids Hessian inverse computations, improving efficiency while maintaining convergence guarantees, and demonstrates its effectiveness on machine learning tasks.

Contribution

The paper develops the first fully single loop bilevel optimization algorithm that does not require Hessian inverse, with theoretical convergence analysis and empirical validation.

Findings

01

Converges with rate O(ε^{-2})

02

Effectively handles hyper-gradient approximation

03

Outperforms double loop methods in experiments

Abstract

In this paper, we propose a new Hessian inverse free Fully Single Loop Algorithm (FSLA) for bilevel optimization problems. Classic algorithms for bilevel optimization admit a double loop structure which is computationally expensive. Recently, several single loop algorithms have been proposed with optimizing the inner and outer variable alternatively. However, these algorithms not yet achieve fully single loop. As they overlook the loop needed to evaluate the hyper-gradient for a given inner and outer state. In order to develop a fully single loop algorithm, we first study the structure of the hyper-gradient and identify a general approximation formulation of hyper-gradient computation that encompasses several previous common approaches, e.g. back-propagation through time, conjugate gradient, \emph{etc.} Based on this formulation, we introduce a new state variable to maintain the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

A Fully Single Loop Algorithm for Bilevel Optimization without Hessian Inverse· underline

Taxonomy

TopicsPediatric Hepatobiliary Diseases and Treatments · Optimization and Variational Analysis · Advanced Optimization Algorithms Research