LaSER: Internalizing Explicit Reasoning into Latent Space for Dense Retrieval

Jiajie Jin; Yanzhao Zhang; Mingxin Li; Dingkun Long; Pengjun Xie; Yutao Zhu; Zhicheng Dou

arXiv:2603.01425·cs.CL·March 3, 2026

LaSER: Internalizing Explicit Reasoning into Latent Space for Dense Retrieval

Jiajie Jin, Yanzhao Zhang, Mingxin Li, Dingkun Long, Pengjun Xie, Yutao Zhu, Zhicheng Dou

PDF

Open Access 3 Models

TL;DR

LaSER introduces a self-distillation framework that internalizes explicit reasoning into dense retrievers' latent space, enabling reasoning capabilities without the latency of explicit CoT generation, and significantly improves performance on reasoning benchmarks.

Contribution

The paper presents LaSER, a novel training method that internalizes explicit reasoning into the latent space of dense retrievers, bridging explicit reasoning paths with implicit latent thinking.

Findings

01

Outperforms state-of-the-art baselines on reasoning benchmarks.

02

Demonstrates robustness across different backbones and model scales.

03

Effectively combines reasoning depth with inference efficiency.

Abstract

LLMs have fundamentally transformed dense retrieval, upgrading backbones from discriminative encoders to generative architectures. However, a critical disconnect remains: while LLMs possess strong reasoning capabilities, current retrievers predominantly utilize them as static encoders, leaving their potential for complex reasoning unexplored. To address this, existing approaches typically adopt rewrite-then-retrieve pipelines to generate explicit CoT rationales before retrieval. However, this incurs prohibitive latency. In this paper, we propose LaSER, a novel self-distillation framework that internalizes explicit reasoning into the latent space of dense retrievers. Operating on a shared LLM backbone, LaSER introduces a dual-view training mechanism: an Explicit view that explicitly encodes ground-truth reasoning paths, and a Latent view that performs implicit latent thinking. To bridge…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Information Retrieval and Search Behavior