When can dictionary learning uniquely recover sparse data from subsamples?
Christopher J. Hillar, Friedrich T. Sommer

TL;DR
This paper establishes theoretical conditions under which sparse dictionary learning can uniquely recover sparse representations from subsampled data, ensuring the correctness of the learned models.
Contribution
It provides new combinatorial matrix theory-based bounds that guarantee uniqueness of sparse codes and dictionaries in dictionary learning.
Findings
Derived bounds on sample sizes for guaranteed uniqueness
Proved that successful reconstruction implies original sparse codes and dictionary
Applicable to neuroscience and data analysis contexts
Abstract
Sparse coding or sparse dictionary learning has been widely used to recover underlying structure in many kinds of natural data. Here, we provide conditions guaranteeing when this recovery is universal; that is, when sparse codes and dictionaries are unique (up to natural symmetries). Our main tool is a useful lemma in combinatorial matrix theory that allows us to derive bounds on the sample sizes guaranteeing such uniqueness under various assumptions for how training data are generated. Whenever the conditions to one of our theorems are met, any sparsity-constrained learning algorithm that succeeds in reconstructing the data recovers the original sparse codes and dictionary. We also discuss potential applications to neuroscience and data analysis.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
