What Language Models Know But Don't Say: Non-Generative Prior Extraction for Generalization
Sara Rezaeimanesh, Mohammad M. Ghassemi

TL;DR
This paper introduces LoID, a deterministic method to extract informative priors from large language models for Bayesian logistic regression, significantly improving out-of-distribution generalization on real-world datasets.
Contribution
LoID provides a novel, efficient approach to leverage LLMs for prior extraction, outperforming existing methods in OOD settings and enhancing Bayesian model performance.
Findings
LoID improves AUC performance by up to 59% over standard logistic regression.
LoID outperforms AutoElicit and LLMProcess on 8 out of 10 datasets.
LoID offers a reproducible, computationally efficient way to incorporate LLM knowledge into Bayesian inference.
Abstract
In domains like medicine and finance, large-scale labeled data is costly and often unavailable, leading to models trained on small datasets that struggle to generalize to real-world populations. Large language models contain extensive knowledge from years of research across these domains. We propose LoID (Logit-Informed Distributions), a deterministic method for extracting informative prior distributions for Bayesian logistic regression by directly accessing their token-level predictions. Rather than relying on generated text, we probe the model's confidence in opposing semantic directions (positive vs. negative impact) through carefully constructed sentences. By measuring how consistently the LLM favors one direction across diverse phrasings, we extract the strength and reliability of the model's belief about each feature's influence. We evaluate LoID on ten real-world tabular datasets…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
