Diverse Dictionary Learning

Yujia Zheng; Zijian Li; Shunxing Fan; Andrew Gordon Wilson; Kun Zhang

arXiv:2604.17568·cs.LG·April 21, 2026

Diverse Dictionary Learning

Yujia Zheng, Zijian Li, Shunxing Fan, Andrew Gordon Wilson, Kun Zhang

PDF

1 Video

TL;DR

This paper introduces diverse dictionary learning, showing that certain set-theoretic structures of latent variables are identifiable under minimal assumptions, enabling more reliable understanding of hidden data.

Contribution

It formalizes the concept of diverse dictionary learning, providing set-theoretic identifiability results and a simple inductive bias applicable to various models.

Findings

01

Set operations on latent variables are identifiable with minimal assumptions.

02

Structural diversity enables full identifiability of all latent variables.

03

The proposed bias improves latent variable recovery on synthetic and real data.

Abstract

Given only observational data $X = g (Z)$ , where both the latent variables $Z$ and the generating process $g$ are unknown, recovering $Z$ is ill-posed without additional assumptions. Existing methods often assume linearity or rely on auxiliary supervision and functional constraints. However, such assumptions are rarely verifiable in practice, and most theoretical guarantees break down under even mild violations, leaving uncertainty about how to reliably understand the hidden world. To make identifiability actionable in the real-world scenarios, we take a complementary view: in the general settings where full identifiability is unattainable, what can still be recovered with guarantees, and what biases could be universally adopted? We introduce the problem of diverse dictionary learning to formalize this view. Specifically, we show that intersections, complements, and symmetric differences…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Diverse Dictionary Learning· slideslive