Bayesian Network Structure Discovery Using Large Language Models
Yinghuan Zhang, Yufei Zhang, Parisa Kordjamshidi, Zijun Cui

TL;DR
This paper presents a novel framework for Bayesian network structure discovery that centrally uses large language models to generate and refine network structures, excelling especially in low-data scenarios.
Contribution
It introduces PromptBN for data-free structure generation and ReActBN for data-aware refinement, integrating LLM reasoning with statistical evidence in a unified approach.
Findings
Outperforms prior methods in low-data regimes
Effective in out-of-distribution datasets
Achieves constant query complexity in structure generation
Abstract
Understanding probabilistic dependencies among variables is central to analyzing complex systems. Traditional structure learning methods often require extensive observational data or are limited by manual, error-prone incorporation of expert knowledge. Recent studies have explored using large language models (LLMs) for structure learning, but most treat LLMs as auxiliary tools for pre-processing or post-processing, leaving the core learning process data-driven. In this work, we introduce a unified framework for Bayesian network structure discovery that places LLMs at the center, supporting both data-free and data-aware settings. In the data-free regime, we introduce \textbf{PromptBN}, which leverages LLM reasoning over variable metadata to generate a complete directed acyclic graph (DAG) in a single call. PromptBN effectively enforces global consistency and acyclicity through dual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Advanced Graph Neural Networks · Explainable Artificial Intelligence (XAI)
