Generative random latent features models and statistics of natural images
Philipp Fleig, Ilya Nemenman

TL;DR
This paper introduces a generative latent feature model that captures key properties of natural images, revealing that sparse coding is an appropriate data decomposition method.
Contribution
The work develops a probabilistic model incorporating dependence in latent features, linking correlation patterns to natural image structures and guiding data decomposition choices.
Findings
Model accurately fits correlation data from natural images.
Sparse mixing regime matches natural image data.
Model captures clusters, sparsity, and non-negativity in data patterns.
Abstract
Complex, multivariable systems are often analyzed by grouping their constituent units into components, sometimes referred to as latent features, which afford physical or biological interpretation. However, a priori many different types of latent features and data decompositions can be defined, and one typically uses a trial and error approach to determine a decomposition that is natural to the system and its data. It is highly desirable to develop principled understanding of which decomposition is appropriate for given a data set. In this work, we take a step in this direction and argue that sample-sample correlations in the data carry important information to this effect. For this we construct a generative random latent feature matrix model of large data based on linear mixing of latent features. Key ingredient of our model is that we allow for statistical dependence between the mixing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
