In-Context Function Learning in Large Language Models
Elif Akata, Konstantinos Voudouris, Vincent Fortuin, Eric Schulz

TL;DR
This paper investigates how large language models learn in-context functions by comparing their performance to Gaussian Process models, analyzing their biases, and exploring methods to improve their sample efficiency through fine-tuning.
Contribution
It introduces a GP-based framework to quantify LLM in-context learning, analyzes their inductive biases, and demonstrates how fine-tuning can shift these biases for better function learning.
Findings
LLMs' in-context learning curves approach GP lower bounds with more demonstrations.
Predictions are biased towards less smooth GP kernels.
Fine-tuning can shift inductive biases towards smoother functions.
Abstract
Large language models (LLMs) can learn from a few demonstrations provided at inference time. We study this in-context learning phenomenon through the lens of Gaussian Processes (GPs). We build controlled experiments where models observe sequences of multivariate scalar-valued function samples drawn from known GP priors. We evaluate prediction error in relation to the number of demonstrations and compare against two principled references: (i) an empirical GP-regression learner that gives a lower bound on achievable error, and (ii) the expected error of a 1-nearest-neighbor (1-NN) rule, which gives a data-driven upper bound. Across model sizes, we find that LLM learning curves are strongly influenced by the function-generating kernels and approach the GP lower bound as the number of demonstrations increases. We then study the inductive biases of these models using a likelihood-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Explainable Artificial Intelligence (XAI) · Artificial Intelligence in Healthcare and Education
