DualSpec: Accelerating Deep Research Agents via Dual-Process Action Speculation

Shuzhang Zhong; Baotong Lu; Qi Chen; Chuanjie Liu; Fan Yang; Meng Li

arXiv:2603.07416·cs.LG·March 10, 2026

DualSpec: Accelerating Deep Research Agents via Dual-Process Action Speculation

Shuzhang Zhong, Baotong Lu, Qi Chen, Chuanjie Liu, Fan Yang, Meng Li

PDF

Open Access

TL;DR

DualSpec introduces a dual-process speculation framework for deep research agents, leveraging action heterogeneity to significantly reduce latency while maintaining accuracy in long-horizon tasks.

Contribution

It proposes a novel heterogeneous speculation approach that differentiates between Search and Visit actions, improving speed and robustness in deep research agents.

Findings

01

Achieves up to 3.28× speedup in end-to-end inference

02

Maintains accuracy comparable to fully reasoning agents

03

Validates effectiveness across multiple models and benchmarks

Abstract

Large language model-based deep research agents have been increasingly popular for addressing long-horizon information-seeking tasks, but they often incur high end-to-end latency due to extensive reasoning and frequent tool use. Speculation frameworks aim to reduce latency by overlapping action execution with reasoning; however, existing approaches typically rely on uniform speculation strategies and strict action matching, which limits inference speedups and robustness. In this work, we revisit the speculate-verify paradigm for deep research agents through the lens of action heterogeneity. We show that \textit{Search} and \textit{Visit} actions exhibit fundamentally different reasoning and model capacity requirements: entropy-based analysis reveals that Search decisions have higher uncertainty and benefit significantly from explicit reasoning, whereas Visit decisions have lower entropy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Artificial Intelligence in Healthcare and Education · Topic Modeling