When can dictionary learning uniquely recover sparse data from   subsamples?

Christopher J. Hillar; Friedrich T. Sommer

arXiv:1106.3616·q-bio.NC·November 18, 2016·IEEE Trans. Inf. Theory

When can dictionary learning uniquely recover sparse data from subsamples?

Christopher J. Hillar, Friedrich T. Sommer

PDF

TL;DR

This paper establishes theoretical conditions under which sparse dictionary learning can uniquely recover sparse representations from subsampled data, ensuring the correctness of the learned models.

Contribution

It provides new combinatorial matrix theory-based bounds that guarantee uniqueness of sparse codes and dictionaries in dictionary learning.

Findings

01

Derived bounds on sample sizes for guaranteed uniqueness

02

Proved that successful reconstruction implies original sparse codes and dictionary

03

Applicable to neuroscience and data analysis contexts

Abstract

Sparse coding or sparse dictionary learning has been widely used to recover underlying structure in many kinds of natural data. Here, we provide conditions guaranteeing when this recovery is universal; that is, when sparse codes and dictionaries are unique (up to natural symmetries). Our main tool is a useful lemma in combinatorial matrix theory that allows us to derive bounds on the sample sizes guaranteeing such uniqueness under various assumptions for how training data are generated. Whenever the conditions to one of our theorems are met, any sparsity-constrained learning algorithm that succeeds in reconstructing the data recovers the original sparse codes and dictionary. We also discuss potential applications to neuroscience and data analysis.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.