Unified Smoothing Approach for Best Hyperparameter Selection Problem Using a Bilevel Optimization Strategy
Jan Harold Alcantara, Chieu Thanh Nguyen, Takayuki Okuno, Akiko, Takeda, Jein-Shan Chen

TL;DR
This paper introduces a unified smoothing approach with new bilevel KKT conditions for optimal hyperparameter selection in nonsmooth regularized problems, improving theoretical tightness and broadening applicability.
Contribution
It proposes tighter bilevel KKT conditions and a unified smoothing method applicable to a wider class of regularizers for hyperparameter optimization.
Findings
New bilevel KKT conditions are tighter than previous ones.
The smoothing approach guarantees accumulation at bilevel KKT points under milder conditions.
Numerical results identify effective smoothing functions for hyperparameter tuning.
Abstract
Strongly motivated from use in various fields including machine learning, the methodology of sparse optimization has been developed intensively so far. Especially, the recent advance of algorithms for solving problems with nonsmooth regularizers is remarkable. However, those algorithms suppose that weight parameters of regularizers, called hyperparameters hereafter, are pre-fixed, and it is a crucial matter how the best hyperparameter should be selected. In this paper, we focus on the hyperparameter selection of regularizers related to the function with and apply a bilevel programming strategy, wherein we need to solve a bilevel problem, whose lower-level problem is nonsmooth, possibly nonconvex and non-Lipschitz. Recently, for solving a bilevel problem for hyperparameter selection of the pure regularizer Okuno et al. discovered new necessary…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Advanced Optimization Algorithms Research · Optimization and Variational Analysis
