Bilevel Optimization with a Lower-level Contraction: Optimal Sample Complexity without Warm-start
Riccardo Grazzi, Massimiliano Pontil, Saverio Salzo

TL;DR
This paper demonstrates that in bilevel problems with a contraction lower-level map, near-optimal sample complexity can be achieved without warm-start techniques, simplifying analysis and broadening applicability.
Contribution
It introduces a simple method combining fixed point iterations and projected inexact gradient descent that attains optimal sample complexity without warm-start, applicable to various bilevel problems.
Findings
Achieves $O( ext{epsilon}^{-2})$ sample complexity in stochastic setting.
Achieves $ ilde{O}( ext{epsilon}^{-1})$ sample complexity in deterministic setting.
Simplifies analysis by avoiding coupled interactions between levels.
Abstract
We analyse a general class of bilevel problems, in which the upper-level problem consists in the minimization of a smooth objective function and the lower-level problem is to find the fixed point of a smooth contraction map. This type of problems include instances of meta-learning, equilibrium models, hyperparameter optimization and data poisoning adversarial attacks. Several recent works have proposed algorithms which warm-start the lower-level problem, i.e.~they use the previous lower-level approximate solution as a staring point for the lower-level solver. This warm-start procedure allows one to improve the sample complexity in both the stochastic and deterministic settings, achieving in some cases the order-wise optimal sample complexity. However, there are situations, e.g., meta learning and equilibrium models, in which the warm-start procedure is not well-suited or ineffective. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Model Reduction and Neural Networks · Groundwater flow and contamination studies
