Convergence Analysis for Deep Sparse Coding via Convolutional Neural Networks
Jianfei Li, Han Feng, Ding-Xuan Zhou

TL;DR
This paper provides a theoretical convergence analysis of deep sparse coding models and convolutional neural networks, extending to various architectures, and offers training strategies to promote sparse feature learning.
Contribution
It introduces a new class of Deep Sparse Coding models, analyzes their convergence properties, and extends the analysis to diverse neural network architectures, including transformers.
Findings
Convergence rates for CNNs in sparse feature extraction are established.
Training strategies effectively promote sparser features in neural networks.
Theoretical foundations support CNNs' use in sparse feature learning tasks.
Abstract
In this work, we explore the intersection of sparse coding theory and deep learning to enhance our understanding of feature extraction capabilities in advanced neural network architectures. We begin by introducing a novel class of Deep Sparse Coding (DSC) models and establish a thorough theoretical analysis of their uniqueness and stability properties. By applying iterative algorithms to these DSC models, we derive convergence rates for convolutional neural networks (CNNs) in their ability to extract sparse features. This provides a strong theoretical foundation for the use of CNNs in sparse feature-learning tasks. We additionally extend this convergence analysis to more general neural network architectures, including those with diverse activation functions, as well as self-attention and transformer-based models. This broadens the applicability of our findings to a wide range of deep…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Compression Techniques
