Analyzing Inexact Hypergradients for Bilevel Learning

Matthias J. Ehrhardt; Lindon Roberts

arXiv:2301.04764·math.OC·November 16, 2023

Analyzing Inexact Hypergradients for Bilevel Learning

Matthias J. Ehrhardt, Lindon Roberts

PDF

Open Access

TL;DR

This paper introduces a flexible framework for computing hypergradients in bilevel learning, unifying existing methods, providing error bounds, and highlighting the importance of hypergradient algorithms in optimization.

Contribution

It presents a unified, adaptable framework for hypergradient computation that connects existing approaches and offers error bounds, improving bilevel learning efficiency.

Findings

01

Unified framework connects implicit function theorem and backpropagation methods.

02

A posteriori error bounds are more accurate than a priori bounds.

03

Choice of hypergradient algorithm significantly impacts bilevel optimization performance.

Abstract

Estimating hyperparameters has been a long-standing problem in machine learning. We consider the case where the task at hand is modeled as the solution to an optimization problem. Here the exact gradient with respect to the hyperparameters cannot be feasibly computed and approximate strategies are required. We introduce a unified framework for computing hypergradients that generalizes existing methods based on the implicit function theorem and automatic differentiation/backpropagation, showing that these two seemingly disparate approaches are actually tightly connected. Our framework is extremely flexible, allowing its subproblems to be solved with any suitable method, to any degree of accuracy. We derive a priori and computable a posteriori error bounds for all our methods, and numerically show that our a posteriori bounds are usually more accurate. Our numerical results also show…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPancreatic and Hepatic Oncology Research · Pediatric Hepatobiliary Diseases and Treatments · Machine Learning and Algorithms