Large Language Models Explore by Latent Distilling

Yuanhao Zeng; Ao Lu; Lufei Li; Zheng Zhang; Yexin Li; Kan Ren

arXiv:2604.24927·cs.CL·April 29, 2026

Large Language Models Explore by Latent Distilling

Yuanhao Zeng, Ao Lu, Lufei Li, Zheng Zhang, Yexin Li, Kan Ren

PDF

1 Repo

TL;DR

This paper introduces ESamp, a novel decoding method for large language models that enhances semantic diversity by leveraging a learned novelty signal, improving reasoning and creative tasks.

Contribution

The paper proposes a test-time training approach that uses a Distiller to predict deep-layer representations, enabling diversity-focused decoding in LLMs.

Findings

01

ESamp improves Pass@k efficiency in reasoning tasks.

02

It generalizes well across math, science, and code benchmarks.

03

It balances diversity and coherence in creative writing.

Abstract

Generating diverse responses is crucial for test-time scaling of large language models (LLMs), yet standard stochastic sampling mostly yields surface-level lexical variation, limiting semantic exploration. In this paper, we propose Exploratory Sampling (ESamp), a decoding approach that explicitly encourages semantic diversity during generation. ESamp is motivated by the well-known observation that neural networks tend to make lower-error predictions on inputs similar to those encountered before, and incur higher prediction error on novel ones. Building on this property, we train a lightweight Distiller at test time to predict deep-layer hidden representations of the LLM from its shallow-layer representations to model the LLM's depth-wise representation transitions. During decoding, the Distiller continuously adapts to the mappings induced by the current generation context. ESamp uses…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

LinesHogan/tLLM
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.