A Clustering Approach to Learn Sparsely-Used Overcomplete Dictionaries

Alekh Agarwal; Animashree Anandkumar; Praneeth Netrapalli

arXiv:1309.1952·stat.ML·July 8, 2014·27 cites

A Clustering Approach to Learn Sparsely-Used Overcomplete Dictionaries

Alekh Agarwal, Animashree Anandkumar, Praneeth Netrapalli

PDF

Open Access

TL;DR

This paper presents an efficient clustering-based algorithm for learning overcomplete dictionaries in sparse coding, enabling approximate recovery of dictionary elements and improving accuracy through a subsequent refinement step.

Contribution

Introduces a novel clustering approach for dictionary learning that efficiently achieves approximate recovery in sparse coding scenarios.

Findings

01

Algorithm successfully recovers dictionary elements with high accuracy

02

Clustering method outperforms existing techniques in certain settings

03

Refinement step with $\,\ell_1$-regression enhances solution quality

Abstract

We consider the problem of learning overcomplete dictionaries in the context of sparse coding, where each sample selects a sparse subset of dictionary elements. Our main result is a strategy to approximately recover the unknown dictionary using an efficient algorithm. Our algorithm is a clustering-style procedure, where each cluster is used to estimate a dictionary element. The resulting solution can often be further cleaned up to obtain a high accuracy estimate, and we provide one simple scenario where $ℓ_{1}$ -regularized regression can be used for such a second stage.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Rough Sets and Fuzzy Logic · Text and Document Classification Technologies