Learning Discrete Concepts in Latent Hierarchical Models
Lingjing Kong, Guangyi Chen, Biwei Huang, Eric P. Xing, Yuejie Chi,, Kun Zhang

TL;DR
This paper formalizes the learning of discrete, hierarchical concepts as latent causal variables in high-dimensional data, providing conditions for their identification and demonstrating implications for models like diffusion models.
Contribution
It introduces a novel hierarchical causal model for concepts in high-dimensional data and establishes conditions for their unsupervised learning, extending prior work to complex structures and continuous variables.
Findings
Theoretical conditions for identifying hierarchical causal concepts.
Empirical validation with synthetic data.
Implications for understanding latent diffusion models.
Abstract
Learning concepts from natural high-dimensional data (e.g., images) holds potential in building human-aligned and interpretable machine learning models. Despite its encouraging prospect, formalization and theoretical insights into this crucial task are still lacking. In this work, we formalize concepts as discrete latent causal variables that are related via a hierarchical causal model that encodes different abstraction levels of concepts embedded in high-dimensional data (e.g., a dog breed and its eye shapes in natural images). We formulate conditions to facilitate the identification of the proposed causal model, which reveals when learning such concepts from unsupervised data is possible. Our conditions permit complex causal hierarchical structures beyond latent trees and multi-level directed acyclic graphs in prior work and can handle high-dimensional, continuous observed variables,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsData Mining Algorithms and Applications · Text and Document Classification Technologies · Machine Learning and Data Classification
MethodsDiffusion
