Learning Identifiable Structures Helps Avoid Bias in DNN-based   Supervised Causal Learning

Jiaru Zhang; Rui Ding; Qiang Fu; Bojun Huang; Zizhen Deng; Yang Hua,; Haibing Guan; Shi Han; Dongmei Zhang

arXiv:2502.10883·cs.LG·February 18, 2025

Learning Identifiable Structures Helps Avoid Bias in DNN-based Supervised Causal Learning

Jiaru Zhang, Rui Ding, Qiang Fu, Bojun Huang, Zizhen Deng, Yang Hua,, Haibing Guan, Shi Han, Dongmei Zhang

PDF

Open Access 1 Repo

TL;DR

This paper introduces SiCL, a novel DNN-based supervised causal learning method that predicts causal structures without bias, outperforming existing approaches by leveraging identifiable structures and a specialized pairwise encoder.

Contribution

The paper proposes SiCL, a new architecture that predicts skeletons and v-structures directly, avoiding bias in traditional node-edge models and enabling consistent causal discovery.

Findings

01

SiCL outperforms existing DNN-based methods on synthetic benchmarks.

02

It effectively predicts causal skeletons and v-structures, aligning with MEC theory.

03

Experimental results demonstrate improved accuracy and consistency.

Abstract

Causal discovery is a structured prediction task that aims to predict causal relations among variables based on their data samples. Supervised Causal Learning (SCL) is an emerging paradigm in this field. Existing Deep Neural Network (DNN)-based methods commonly adopt the "Node-Edge approach", in which the model first computes an embedding vector for each variable-node, then uses these variable-wise representations to concurrently and independently predict for each directed causal-edge. In this paper, we first show that this architecture has some systematic bias that cannot be mitigated regardless of model size and data size. We then propose SiCL, a DNN-based SCL method that predicts a skeleton matrix together with a v-tensor (a third-order tensor representing the v-structures). According to the Markov Equivalence Class (MEC) theory, both the skeleton and the v-structures are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

microsoft/reliableai
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Fault Detection and Control Systems

MethodsSoftmax · Attention Is All You Need · ADaptive gradient method with the OPTimal convergence rate