Towards Understanding Sparse Filtering: A Theoretical Perspective

Fabio Massimo Zennaro; Ke Chen

arXiv:1603.08831·cs.LG·May 25, 2021

Towards Understanding Sparse Filtering: A Theoretical Perspective

Fabio Massimo Zennaro, Ke Chen

PDF

TL;DR

This paper offers a comprehensive theoretical analysis of sparse filtering, explaining why and when it works, supported by empirical validation on artificial and real datasets, and providing insights for future algorithm development.

Contribution

It provides the first thorough theoretical understanding of sparse filtering, revealing its mechanisms and conditions for success, and validates these insights experimentally.

Findings

01

Sparse filtering maximizes entropy of learned representations.

02

It implicitly preserves mutual information through data structure constraints.

03

Theoretical insights explain its effectiveness on real-world problems.

Abstract

In this paper we present a theoretical analysis to understand sparse filtering, a recent and effective algorithm for unsupervised learning. The aim of this research is not to show whether or how well sparse filtering works, but to understand why and when sparse filtering does work. We provide a thorough theoretical analysis of sparse filtering and its properties, and further offer an experimental validation of the main outcomes of our theoretical analysis. We show that sparse filtering works by explicitly maximizing the entropy of the learned representation through the maximization of the proxy of sparsity, and by implicitly preserving mutual information between original and learned representations through the constraint of preserving a structure of the data, specifically the structure defined by relations of neighborhoodness under the cosine distance. Furthermore, we empirically…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.