DLLM Agent: See Farther, Run Faster

Huiling Zhen; Weizhe Lin; Renxi Liu; Kai Han; Yiming Li; Yuchuan Tian; Hanting Chen; Xiaoguang Li; Xiaosong Li; Chen Chen; Xianzhi Yu; Mingxuan Yuan; Youliang Yan; Peifeng Qin; Jun Wang; Yu Wang; Dacheng Tao; Yunhe Wang

arXiv:2602.07451·cs.CL·March 23, 2026

DLLM Agent: See Farther, Run Faster

Huiling Zhen, Weizhe Lin, Renxi Liu, Kai Han, Yiming Li, Yuchuan Tian, Hanting Chen, Xiaoguang Li, Xiaosong Li, Chen Chen, Xianzhi Yu, Mingxuan Yuan, Youliang Yan, Peifeng Qin, Jun Wang, Yu Wang, Dacheng Tao, Yunhe Wang

PDF

Open Access 1 Models

TL;DR

This paper compares diffusion large language models (DLLMs) with autoregressive models in agent-based decision making, showing DLLMs are faster and more efficient with fewer interactions, but require careful training for tool use and attention handling.

Contribution

It demonstrates that DLLMs can outperform AR models in agent workflows, providing insights into their planning behavior and practical deployment considerations.

Findings

01

DLLM agents are over 30% faster than AR agents at similar accuracy.

02

DLLM agents need fewer interaction rounds and tool calls for correct task completion.

03

Proper training and attention alignment are crucial for effective diffusion-based agent performance.

Abstract

Diffusion large language models (DLLMs) have emerged as an alternative to autoregressive (AR) decoding with appealing efficiency and modeling properties, yet their implications for agentic multi-step decision making remain underexplored. We ask a concrete question: when the generation paradigm is changed but the agent framework and supervision are held fixed, do diffusion backbones induce systematically different planning and tool-use behaviors, and do these differences translate into end-to-end efficiency gains? We study this in a controlled setting by instantiating DLLM and AR backbones within the same agent workflow (DeepDiver) and performing matched agent-oriented fine-tuning on the same trajectory data, yielding diffusion-backed DLLM Agents and directly comparable AR agents. Across benchmarks and case studies, we find that, at comparable accuracy, DLLM Agents are on average over…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
DLLM-Agent/openPangu-7B-Diffusion-DeepDiver
model· 154 dl· ♡ 7
154 dl♡ 7

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Machine Learning in Healthcare