LLM-Prior: A Framework for Knowledge-Driven Prior Elicitation and Aggregation
Yongchao Huang

TL;DR
This paper introduces LLM-Prior, a framework that uses large language models to automate the creation and aggregation of prior distributions in Bayesian inference, making the process more scalable and less subjective.
Contribution
It presents a novel operator coupling LLMs with generative models for valid prior elicitation and extends it to multi-agent systems for robust prior aggregation.
Findings
Automates prior elicitation from unstructured data.
Ensures mathematical validity of generated priors.
Provides a federated algorithm for distributed prior aggregation.
Abstract
The specification of prior distributions is fundamental in Bayesian inference, yet it remains a significant bottleneck. The prior elicitation process is often a manual, subjective, and unscalable task. We propose a novel framework which leverages Large Language Models (LLMs) to automate and scale this process. We introduce \texttt{LLMPrior}, a principled operator that translates rich, unstructured contexts such as natural language descriptions, data or figures into valid, tractable probability distributions. We formalize this operator by architecturally coupling an LLM with an explicit, tractable generative model, such as a Gaussian Mixture Model (forming a LLM based Mixture Density Network), ensuring the resulting prior satisfies essential mathematical properties. We further extend this framework to multi-agent systems where Logarithmic Opinion Pooling is employed to aggregate prior…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInformation Technology and Learning · Higher Education Learning Practices · Logic, Reasoning, and Knowledge
