LLM Processes: Numerical Predictive Distributions Conditioned on Natural   Language

James Requeima; John Bronskill; Dami Choi; Richard E. Turner; David; Duvenaud

arXiv:2405.12856·stat.ML·December 23, 2024·6 cites

LLM Processes: Numerical Predictive Distributions Conditioned on Natural Language

James Requeima, John Bronskill, Dami Choi, Richard E. Turner, David, Duvenaud

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces LLM Processes, a method for generating probabilistic numerical predictions conditioned on natural language, enabling more nuanced, expert-informed regression models leveraging large language models.

Contribution

The paper presents a novel approach to eliciting coherent predictive distributions from LLMs conditioned on natural language, expanding their use in probabilistic regression and related tasks.

Findings

01

Effective elicitation of predictive distributions from LLMs

02

Improved regression performance with natural language guidance

03

Demonstrated applicability across forecasting, optimization, and image modeling

Abstract

Machine learning practitioners often face significant challenges in formally integrating their prior knowledge and beliefs into predictive models, limiting the potential for nuanced and context-aware analyses. Moreover, the expertise needed to integrate this prior knowledge into probabilistic modeling typically limits the application of these models to specialists. Our goal is to build a regression model that can process numerical data and make probabilistic predictions at arbitrary locations, guided by natural language text which describes a user's prior knowledge. Large Language Models (LLMs) provide a useful starting point for designing such a tool since they 1) provide an interface where users can incorporate expert insights in natural language and 2) provide an opportunity for leveraging latent problem-relevant knowledge encoded in LLMs that users may not have themselves. We start…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

requeima/llm_processes
pytorchOfficial

Videos

LLM Processes: Numerical Predictive Distributions Conditioned on Natural Language· slideslive

Taxonomy

TopicsNeural Networks and Applications