Analyzing Inexact Hypergradients for Bilevel Learning
Matthias J. Ehrhardt, Lindon Roberts

TL;DR
This paper introduces a flexible framework for computing hypergradients in bilevel learning, unifying existing methods, providing error bounds, and highlighting the importance of hypergradient algorithms in optimization.
Contribution
It presents a unified, adaptable framework for hypergradient computation that connects existing approaches and offers error bounds, improving bilevel learning efficiency.
Findings
Unified framework connects implicit function theorem and backpropagation methods.
A posteriori error bounds are more accurate than a priori bounds.
Choice of hypergradient algorithm significantly impacts bilevel optimization performance.
Abstract
Estimating hyperparameters has been a long-standing problem in machine learning. We consider the case where the task at hand is modeled as the solution to an optimization problem. Here the exact gradient with respect to the hyperparameters cannot be feasibly computed and approximate strategies are required. We introduce a unified framework for computing hypergradients that generalizes existing methods based on the implicit function theorem and automatic differentiation/backpropagation, showing that these two seemingly disparate approaches are actually tightly connected. Our framework is extremely flexible, allowing its subproblems to be solved with any suitable method, to any degree of accuracy. We derive a priori and computable a posteriori error bounds for all our methods, and numerically show that our a posteriori bounds are usually more accurate. Our numerical results also show…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPancreatic and Hepatic Oncology Research · Pediatric Hepatobiliary Diseases and Treatments · Machine Learning and Algorithms
