$\psi$DAG: Projected Stochastic Approximation Iteration for DAG Structure Learning
Klea Ziu, Slavom\'ir Hanzely, Loka Li, Kun Zhang, Martin Tak\'a\v{c},, Dmitry Kamzolov

TL;DR
This paper introduces $$DAG, a stochastic approximation method with projection techniques for efficient and scalable DAG structure learning, addressing optimization challenges in large-scale problems.
Contribution
It proposes a novel stochastic approximation framework with specialized projection methods for DAG learning, improving efficiency and scalability over existing approaches.
Findings
Outperforms existing methods in large-scale DAG learning tasks.
Demonstrates improved convergence and computational efficiency.
Achieves superior accuracy across various experimental settings.
Abstract
Learning the structure of Directed Acyclic Graphs (DAGs) presents a significant challenge due to the vast combinatorial search space of possible graphs, which scales exponentially with the number of nodes. Recent advancements have redefined this problem as a continuous optimization task by incorporating differentiable acyclicity constraints. These methods commonly rely on algebraic characterizations of DAGs, such as matrix exponentials, to enable the use of gradient-based optimization techniques. Despite these innovations, existing methods often face optimization difficulties due to the highly non-convex nature of DAG constraints and the per-iteration computational complexity. In this work, we present a novel framework for learning DAGs, employing a Stochastic Approximation approach integrated with Stochastic Gradient Descent (SGD)-based optimization techniques. Our framework introduces…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
