Learning Sparse Visual Representations with Leaky Capped Norm Regularizers
Jianqiao Wangni, Dahua Lin

TL;DR
This paper introduces the leaky capped norm regularizer for sparse visual representations, demonstrating its effectiveness in 3D shape recovery and neural networks with faster convergence and theoretical guarantees.
Contribution
It proposes the novel LCNR regularizer, develops a majorization-minimization algorithm, and provides the first convergence analysis for 3D recovery.
Findings
Outperforms $\\ell_1$ and other non-convex regularizations in 3D shape recovery
Achieves state-of-the-art performance and faster convergence
Provides the first theoretical convergence proof for 3D recovery
Abstract
Sparsity inducing regularization is an important part for learning over-complete visual representations. Despite the popularity of regularization, in this paper, we investigate the usage of non-convex regularizations in this problem. Our contribution consists of three parts. First, we propose the leaky capped norm regularization (LCNR), which allows model weights below a certain threshold to be regularized more strongly as opposed to those above, therefore imposes strong sparsity and only introduces controllable estimation bias. We propose a majorization-minimization algorithm to optimize the joint objective function. Second, our study over monocular 3D shape recovery and neural networks with LCNR outperforms and other non-convex regularizations, achieving state-of-the-art performance and faster convergence. Third, we prove a theoretical global convergence speed on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Sparse and Compressive Sensing Techniques · 3D Shape Modeling and Analysis
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
