Towards Understanding Sparse Filtering: A Theoretical Perspective
Fabio Massimo Zennaro, Ke Chen

TL;DR
This paper offers a comprehensive theoretical analysis of sparse filtering, explaining why and when it works, supported by empirical validation on artificial and real datasets, and providing insights for future algorithm development.
Contribution
It provides the first thorough theoretical understanding of sparse filtering, revealing its mechanisms and conditions for success, and validates these insights experimentally.
Findings
Sparse filtering maximizes entropy of learned representations.
It implicitly preserves mutual information through data structure constraints.
Theoretical insights explain its effectiveness on real-world problems.
Abstract
In this paper we present a theoretical analysis to understand sparse filtering, a recent and effective algorithm for unsupervised learning. The aim of this research is not to show whether or how well sparse filtering works, but to understand why and when sparse filtering does work. We provide a thorough theoretical analysis of sparse filtering and its properties, and further offer an experimental validation of the main outcomes of our theoretical analysis. We show that sparse filtering works by explicitly maximizing the entropy of the learned representation through the maximization of the proxy of sparsity, and by implicitly preserving mutual information between original and learned representations through the constraint of preserving a structure of the data, specifically the structure defined by relations of neighborhoodness under the cosine distance. Furthermore, we empirically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
