Proximal Iteration for Nonlinear Adaptive Lasso
Nathan Wycoff, Lisa O. Singh, Ali Arab, Katharine M. Donato

TL;DR
This paper introduces a joint optimization method using proximal gradient techniques to learn penalty coefficients in adaptive Lasso, reducing bias and enabling complex sparsity structures in nonlinear models.
Contribution
It develops a novel proximal gradient approach that treats penalty coefficients as decision variables, enhancing adaptive Lasso's flexibility and effectiveness in complex models.
Findings
The method is competitive in speed and accuracy on synthetic and real datasets.
It effectively reduces bias in parameter estimates.
Demonstrates applicability to nonlinear models in real-world case studies.
Abstract
Augmenting a smooth cost function with an penalty allows analysts to efficiently conduct estimation and variable selection simultaneously in sophisticated models and can be efficiently implemented using proximal gradient methods. However, one drawback of the penalty is bias: nonzero parameters are underestimated in magnitude, motivating techniques such as the Adaptive Lasso which endow each parameter with its own penalty coefficient. But it's not clear how these parameter-specific penalties should be set in complex models. In this article, we study the approach of treating the penalty coefficients as additional decision variables to be learned in a \textit{Maximum a Posteriori} manner, developing a proximal gradient approach to joint optimization of these together with the parameters of any differentiable cost function. Beyond reducing bias in estimates, this procedure…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and financial applications · Advanced Control Systems Optimization
MethodsSparse Evolutionary Training · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
