Learning Adaptive LLM Decoding

Chloe H. Su; Zhe Ye; Samuel Tenka; Aidan Yang; Soonho Kong; Udaya Ghai

arXiv:2603.09065·cs.LG·March 17, 2026

Learning Adaptive LLM Decoding

Chloe H. Su, Zhe Ye, Samuel Tenka, Aidan Yang, Soonho Kong, Udaya Ghai

PDF

Open Access

TL;DR

This paper introduces a method for learning adaptive decoding policies for large language models, dynamically selecting sampling strategies during inference to improve accuracy and efficiency across tasks like math and coding.

Contribution

It proposes lightweight reinforcement learning-based decoding adapters that adapt sampling strategies at both sequence and token levels without fine-tuning the language model.

Findings

01

Token-level adapter improves Pass@1 accuracy by up to 10.2% on MATH.

02

Sequence-level adapter yields 2-3% accuracy gains under fixed sampling budgets.

03

Both sequence- and token-level adaptations contribute to improved performance.

Abstract

Decoding from large language models (LLMs) typically relies on fixed sampling hyperparameters (e.g., temperature, top-p), despite substantial variation in task difficulty and uncertainty across prompts and individual decoding steps. We propose to learn adaptive decoding policies that dynamically select sampling strategies at inference time, conditioned on available compute resources. Rather than fine-tuning the language model itself, we introduce lightweight decoding adapters trained with reinforcement learning and verifiable terminal rewards (e.g. correctness on math and coding tasks). At the sequence level, we frame decoding as a contextual bandit problem: a policy selects a decoding strategy (e.g. greedy, top-k, min-p) for each prompt, conditioned on the prompt embedding and a parallel sampling budget. At the token level, we model decoding as a partially observable Markov decision…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Machine Learning and Data Classification