Structural Causality-based Generalizable Concept Discovery Models
Sanchit Sinha, Guangzhi Xiong, Aidong Zhang

TL;DR
This paper introduces a novel approach combining variational autoencoders and structural causal models to discover task-specific, explainable concepts in neural networks, adaptable to various tasks and datasets.
Contribution
It proposes a disentanglement mechanism using VAE and SCM to learn task-specific concepts from generative factors, enhancing explainability and generalizability.
Findings
Successfully learned task-specific concepts on D-sprites and Shapes3D datasets.
Concepts are well explained by causal edges from generative factors.
Method is flexible to any number of concepts and downstream tasks.
Abstract
The rising need for explainable deep neural network architectures has utilized semantic concepts as explainable units. Several approaches utilizing disentangled representation learning estimate the generative factors and utilize them as concepts for explaining DNNs. However, even though the generative factors for a dataset remain fixed, concepts are not fixed entities and vary based on downstream tasks. In this paper, we propose a disentanglement mechanism utilizing a variational autoencoder (VAE) for learning mutually independent generative factors for a given dataset and subsequently learning task-specific concepts using a structural causal model (SCM). Our method assumes generative factors and concepts to form a bipartite graph, with directed causal edges from generative factors to concepts. Experiments are conducted on datasets with known generative factors: D-sprites and Shapes3D.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Bayesian Modeling and Causal Inference · Rough Sets and Fuzzy Logic
