M-estimation with the Trimmed l1 Penalty
Jihun Yun, Peng Zheng, Eunho Yang, Aurelie Lozano, Aleksandr Aravkin

TL;DR
This paper introduces the first statistical analysis of high-dimensional estimators with the trimmed penalty, demonstrating its advantages in support recovery, error bounds, and applications in deep learning tasks.
Contribution
It provides the first statistical properties of the trimmed penalty, develops a new efficient algorithm, and shows its benefits in various applications including deep learning.
Findings
Trimmed outperforms vanilla in sparse linear regression.
The proposed algorithm converges faster and finds lower objective values.
Trimmed improves support recovery and error bounds in high-dimensional estimation.
Abstract
We study high-dimensional estimators with the trimmed penalty, which leaves the largest parameter entries penalty-free. While optimization techniques for this nonconvex penalty have been studied, the statistical properties have not yet been analyzed. We present the first statistical analyses for -estimation and characterize support recovery, and error of the trimmed estimates as a function of the trimming parameter . Our results show different regimes based on how compares to the true support size. Our second contribution is a new algorithm for the trimmed regularization problem, which has the same theoretical convergence rate as the difference of convex (DC) algorithms, but in practice is faster and finds lower objective values. Empirical evaluation of trimming for sparse linear regression and graphical model estimation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Statistical Methods and Inference · Distributed Sensor Networks and Detection Algorithms
