Convex Mixed-Integer Programming for Causal Additive Models with Optimization and Statistical Guarantees
Xiaozhu Zhang, Nir Keret, Ali Shojaie, Armeen Taeb

TL;DR
This paper introduces a convex mixed-integer programming approach for learning causal graphs from non-linear additive models, providing statistical consistency and optimization guarantees, and demonstrating superior performance over existing methods.
Contribution
It develops a novel convex mixed-integer program for causal graph learning with statistical guarantees and practical optimization strategies, extending beyond linear or heuristic methods.
Findings
Achieves consistent graph recovery with growing variables.
Provides an early stopping criterion with guarantees.
Outperforms existing methods in simulations and real data.
Abstract
We study the problem of learning a directed acyclic graph from data generated according to an additive, non-linear structural equation model with Gaussian noise. We express each non-linear function through a basis expansion, and derive a maximum likelihood estimator with a group l0-regularization that penalizes the number of edges in the graph. The resulting estimator is formulated through a convex mixed-integer program, enabling the use of branch-and-bound methods to obtain a solution that is guaranteed to be accurate up to a pre-specified optimality gap. Our formulation can naturally encode background knowledge, such as the presence or absence of edges and partial order constraints among the variables. We establish consistency guarantees for our estimator in terms of graph recovery, even when the number of variables grows with the sample size. Additionally, by connecting the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Advanced Graph Neural Networks · Statistical Methods and Inference
