Learning linear structural equation models in polynomial time and sample complexity
Asish Ghoshal, Jean Honorio

TL;DR
This paper introduces a new algorithm for learning linear structural equation models (SEMs) from observational data, achieving computational and statistical efficiency in high-dimensional settings without relying on faithfulness assumptions.
Contribution
The paper presents a novel algorithm for linear SEMs that improves identifiability conditions, reduces computational complexity, and handles high-dimensional data with arbitrary noise distributions.
Findings
Recovers DAG structure in $ ilde{O}(p(d^2 + ext{log} p))$ operations.
Achieves sample complexity of $O(d^8/ ext{epsilon}^2 imes ext{log}(p/ ext{sqrt}( ext{delta})))$ for sub-Gaussian noise.
Handles high-dimensional SEMs with arbitrary noise, improving over previous methods.
Abstract
The problem of learning structural equation models (SEMs) from data is a fundamental problem in causal inference. We develop a new algorithm --- which is computationally and statistically efficient and works in the high-dimensional regime --- for learning linear SEMs from purely observational data with arbitrary noise distribution. We consider three aspects of the problem: identifiability, computational efficiency, and statistical efficiency. We show that when data is generated from a linear SEM over nodes and maximum degree , our algorithm recovers the directed acyclic graph (DAG) structure of the SEM under an identifiability condition that is more general than those considered in the literature, and without faithfulness assumptions. In the population setting, our algorithm recovers the DAG structure in operations. In the finite sample setting, if…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Machine Learning and Algorithms · Statistical Methods and Inference
