LLM-Prior: A Framework for Knowledge-Driven Prior Elicitation and Aggregation

Yongchao Huang

arXiv:2508.03766·cs.LG·August 7, 2025

LLM-Prior: A Framework for Knowledge-Driven Prior Elicitation and Aggregation

Yongchao Huang

PDF

Open Access

TL;DR

This paper introduces LLM-Prior, a framework that uses large language models to automate the creation and aggregation of prior distributions in Bayesian inference, making the process more scalable and less subjective.

Contribution

It presents a novel operator coupling LLMs with generative models for valid prior elicitation and extends it to multi-agent systems for robust prior aggregation.

Findings

01

Automates prior elicitation from unstructured data.

02

Ensures mathematical validity of generated priors.

03

Provides a federated algorithm for distributed prior aggregation.

Abstract

The specification of prior distributions is fundamental in Bayesian inference, yet it remains a significant bottleneck. The prior elicitation process is often a manual, subjective, and unscalable task. We propose a novel framework which leverages Large Language Models (LLMs) to automate and scale this process. We introduce \texttt{LLMPrior}, a principled operator that translates rich, unstructured contexts such as natural language descriptions, data or figures into valid, tractable probability distributions. We formalize this operator by architecturally coupling an LLM with an explicit, tractable generative model, such as a Gaussian Mixture Model (forming a LLM based Mixture Density Network), ensuring the resulting prior satisfies essential mathematical properties. We further extend this framework to multi-agent systems where Logarithmic Opinion Pooling is employed to aggregate prior…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInformation Technology and Learning · Higher Education Learning Practices · Logic, Reasoning, and Knowledge