A Model for Combinatorial Dictionary Learning and Inference
Avrim Blum, Kavya Ravichandran

TL;DR
This paper introduces a combinatorial model for dictionary learning inspired by object occlusion, establishing conditions for learning components and addressing inference problems with robustness considerations.
Contribution
It proposes the concept of well-structuredness for component sets, providing theoretical guarantees for learning and inference in a combinatorial dictionary model.
Findings
Well-structuredness ensures learnability of component sets.
Algorithms for explaining instances with minimal components or maximal locations.
Robust inference method resilient to adversarial corruptions.
Abstract
We are often interested in decomposing complex, structured data into simple components that explain the data. The linear version of this problem is well-studied as dictionary learning and factor analysis. In this work, we propose a combinatorial model in which to study this question, motivated by the way objects occlude each other in a scene to form an image. First, we identify a property we call "well-structuredness" of a set of low-dimensional components which ensures that no two components in the set are too similar. We show how well-structuredness is sufficient for learning the set of latent components comprising a set of sample instances. We then consider the problem: given a set of components and an instance generated from some unknown subset of them, identify which parts of the instance arise from which components. We consider two variants: (1) determine the minimal number of…
Peer Reviews
Decision·ALT 2025
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · linguistics and terminology studies · Lexicography and Language Studies
MethodsSparse Evolutionary Training
