Near-Linear Time Projection onto the $\ell_{1,\infty}$ Ball; Application to Sparse Autoencoders
Guillaume Perez, Laurent Condat, Michel Barlaud

TL;DR
This paper introduces a fast, exact projection algorithm onto the $\, ext{l}_{1, ext{infinity}}$ norm ball with applications to training sparse autoencoders, significantly improving efficiency in enforcing sparsity in neural networks.
Contribution
A novel projection algorithm for the $\, ext{l}_{1, ext{infinity}}$ norm ball with near-linear time complexity and practical implementation for neural network sparsification.
Findings
Algorithm is the fastest for sparsity enforcement.
Exact projection guarantees convergence in finite time.
Application to biological data demonstrates effectiveness.
Abstract
Looking for sparsity is nowadays crucial to speed up the training of large-scale neural networks. Projections onto the and are among the most efficient techniques to sparsify and reduce the overall cost of neural networks. In this paper, we introduce a new projection algorithm for the norm ball. The worst-case time complexity of this algorithm is for a matrix in . is a term that tends to 0 when the sparsity is high, and to when the sparsity is low. Its implementation is easy and it is guaranteed to converge to the exact solution in a finite time. Moreover, we propose to incorporate the ball projection while training an autoencoder to enforce feature selection and sparsity of the weights. Sparsification appears in the encoder to primarily do feature…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Machine Learning and Algorithms · Tensor decomposition and applications
MethodsFeature Selection · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
