Personalized Dictionary Learning for Heterogeneous Datasets

Geyu Liang; Naichen Shi; Raed Al Kontar; Salar Fattahi

arXiv:2305.15311·cs.LG·May 25, 2023·1 cites

Personalized Dictionary Learning for Heterogeneous Datasets

Geyu Liang, Naichen Shi, Raed Al Kontar, Salar Fattahi

PDF

Open Access 1 Video

TL;DR

This paper proposes a novel framework called Personalized Dictionary Learning (PerDL) for extracting shared and unique features from heterogeneous datasets, with a new algorithm PerMA that guarantees efficient recovery of dictionaries.

Contribution

It formulates the PerDL problem, provides conditions for disentangling shared and unique features, and introduces PerMA, a provably convergent algorithm for this task.

Findings

01

PerMA converges linearly to the true dictionaries under certain conditions.

02

The framework effectively handles imbalanced datasets and video surveillance tasks.

03

PerDL improves feature extraction from heterogeneous data sources.

Abstract

We introduce a relevant yet challenging problem named Personalized Dictionary Learning (PerDL), where the goal is to learn sparse linear representations from heterogeneous datasets that share some commonality. In PerDL, we model each dataset's shared and unique features as global and local dictionaries. Challenges for PerDL not only are inherited from classical dictionary learning (DL), but also arise due to the unknown nature of the shared and unique features. In this paper, we rigorously formulate this problem and provide conditions under which the global and local dictionaries can be provably disentangled. Under these conditions, we provide a meta-algorithm called Personalized Matching and Averaging (PerMA) that can recover both global and local dictionaries from heterogeneous datasets. PerMA is highly efficient; it converges to the ground truth at a linear rate under suitable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Personalized Dictionary Learning for Heterogeneous Datasets· slideslive

Taxonomy

TopicsText and Document Classification Technologies · Multimodal Machine Learning Applications · Speech Recognition and Synthesis