On the Convergence of Continuous Constrained Optimization for Structure Learning
Ignavier Ng, S\'ebastien Lachapelle, Nan Rosemary Ke, Simon, Lacoste-Julien, Kun Zhang

TL;DR
This paper investigates the convergence properties of continuous constrained optimization methods, specifically ALM and QPM, for structure learning of DAGs, revealing limitations of ALM and establishing convergence guarantees for QPM.
Contribution
It provides the first analysis of ALM's convergence in DAG structure learning, shows QPM's convergence to DAGs under mild conditions, and connects these findings with existing methods.
Findings
ALM does not guarantee convergence to DAGs in practice.
QPM converges to DAG solutions under mild conditions.
Empirical results support theoretical convergence guarantees.
Abstract
Recently, structure learning of directed acyclic graphs (DAGs) has been formulated as a continuous optimization problem by leveraging an algebraic characterization of acyclicity. The constrained problem is solved using the augmented Lagrangian method (ALM) which is often preferred to the quadratic penalty method (QPM) by virtue of its standard convergence result that does not require the penalty coefficient to go to infinity, hence avoiding ill-conditioning. However, the convergence properties of these methods for structure learning, including whether they are guaranteed to return a DAG solution, remain unclear, which might limit their practical applications. In this work, we examine the convergence of ALM and QPM for structure learning in the linear, nonlinear, and confounded cases. We show that the standard convergence result of ALM does not hold in these settings, and demonstrate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Advanced Graph Neural Networks · Bayesian Modeling and Causal Inference
