Learning Sparse Visual Representations with Leaky Capped Norm   Regularizers

Jianqiao Wangni; Dahua Lin

arXiv:1711.02857·cs.LG·November 9, 2017·1 cites

Learning Sparse Visual Representations with Leaky Capped Norm Regularizers

Jianqiao Wangni, Dahua Lin

PDF

Open Access

TL;DR

This paper introduces the leaky capped norm regularizer for sparse visual representations, demonstrating its effectiveness in 3D shape recovery and neural networks with faster convergence and theoretical guarantees.

Contribution

It proposes the novel LCNR regularizer, develops a majorization-minimization algorithm, and provides the first convergence analysis for 3D recovery.

Findings

01

Outperforms $\\ell_1$ and other non-convex regularizations in 3D shape recovery

02

Achieves state-of-the-art performance and faster convergence

03

Provides the first theoretical convergence proof for 3D recovery

Abstract

Sparsity inducing regularization is an important part for learning over-complete visual representations. Despite the popularity of $ℓ_{1}$ regularization, in this paper, we investigate the usage of non-convex regularizations in this problem. Our contribution consists of three parts. First, we propose the leaky capped norm regularization (LCNR), which allows model weights below a certain threshold to be regularized more strongly as opposed to those above, therefore imposes strong sparsity and only introduces controllable estimation bias. We propose a majorization-minimization algorithm to optimize the joint objective function. Second, our study over monocular 3D shape recovery and neural networks with LCNR outperforms $ℓ_{1}$ and other non-convex regularizations, achieving state-of-the-art performance and faster convergence. Third, we prove a theoretical global convergence speed on the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Sparse and Compressive Sensing Techniques · 3D Shape Modeling and Analysis

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings