Extracting Probabilistic Knowledge from Large Language Models for Bayesian Network Parameterization

Aliakbar Nafar; Kristen Brent Venable; Zijun Cui; Parisa Kordjamshidi

arXiv:2505.15918·cs.CL·August 12, 2025

Extracting Probabilistic Knowledge from Large Language Models for Bayesian Network Parameterization

Aliakbar Nafar, Kristen Brent Venable, Zijun Cui, Parisa Kordjamshidi

PDF

Open Access

TL;DR

This paper investigates using Large Language Models to extract probabilistic knowledge for Bayesian Network parameterization, demonstrating their potential to generate meaningful priors across various domains and improve modeling with limited data.

Contribution

It introduces a novel method for leveraging LLMs to derive probabilistic priors for Bayesian Networks and establishes the first comprehensive baseline for evaluating LLMs in this context.

Findings

01

LLMs can produce meaningful probabilistic estimates for BN parameters.

02

Querying LLMs outperforms random and uniform baselines.

03

LLM-derived priors enhance Bayesian modeling, especially with scarce data.

Abstract

In this work, we evaluate the potential of Large Language Models (LLMs) in building Bayesian Networks (BNs) by approximating domain expert priors. LLMs have demonstrated potential as factual knowledge bases; however, their capability to generate probabilistic knowledge about real-world events remains understudied. We explore utilizing the probabilistic knowledge inherent in LLMs to derive probability estimates for statements regarding events and their relationships within a BN. Using LLMs in this context allows for the parameterization of BNs, enabling probabilistic modeling within specific domains. Our experiments on eighty publicly available Bayesian Networks, from healthcare to finance, demonstrate that querying LLMs about the conditional probabilities of events provides meaningful results when compared to baselines, including random and uniform distributions, as well as approaches…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBayesian Modeling and Causal Inference · Topic Modeling · Machine Learning in Healthcare