New Computational and Statistical Aspects of Regularized Regression with Application to Rare Feature Selection and Aggregation
Amin Jalali, Adel Javanmard, Maryam Fazel

TL;DR
This paper introduces a unified computational framework for structured regularization norms, focusing on rare feature selection and aggregation, with applications to high-dimensional regression and machine learning.
Contribution
It develops a new norm called the doubly-sparse norm, along with optimization tools and statistical bounds, for regularized regression involving rare features.
Findings
Proposes the doubly-sparse norm for promoting sparse and grouped features.
Develops an efficient quadratic programming approach for projections.
Provides statistical bounds for feature selection and aggregation.
Abstract
Prior knowledge on properties of a target model often come as discrete or combinatorial descriptions. This work provides a unified computational framework for defining norms that promote such structures. More specifically, we develop associated tools for optimization involving such norms given only the orthogonal projection oracle onto the non-convex set of desired models. As an example, we study a norm, which we term the doubly-sparse norm, for promoting vectors with few nonzero entries taking only a few distinct values. We further discuss how the K-means algorithm can serve as the underlying projection oracle in this case and how it can be efficiently represented as a quadratically constrained quadratic program. Our motivation for the study of this norm is regularized regression in the presence of rare features which poses a challenge to various methods within high-dimensional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Statistical Methods and Inference · Face and Expression Recognition
