Atom-Searcher: Enhancing Agentic Deep Research via Fine-Grained Atomic Thought Reward

Yong Deng; Guoqing Wang; Zhenzhe Ying; Xiaofeng Wu; Jinzhen Lin; Wenwen Xiong; Yuqin Dai; Shuo Yang; Zhanwei Zhang; Qiwen Wang; Yang Qin; Yuan Wang; Quanxing Zha; Sunhao Dai; Changhua Meng

arXiv:2508.12800·cs.CL·September 1, 2025

Atom-Searcher: Enhancing Agentic Deep Research via Fine-Grained Atomic Thought Reward

Yong Deng, Guoqing Wang, Zhenzhe Ying, Xiaofeng Wu, Jinzhen Lin, Wenwen Xiong, Yuqin Dai, Shuo Yang, Zhanwei Zhang, Qiwen Wang, Yang Qin, Yuan Wang, Quanxing Zha, Sunhao Dai, Changhua Meng

PDF

1 Models

TL;DR

Atom-Searcher introduces a fine-grained reasoning framework with Atomic Thought and reward models, significantly improving multi-hop reasoning and strategic search in LLMs through a curriculum-based RL approach.

Contribution

The paper proposes Atomic Thought and ATR to enhance RL-based agentic research, enabling better reasoning, interpretability, and scalability in LLMs.

Findings

01

Consistent performance improvements on seven benchmarks.

02

Enhanced interpretability and human-like reasoning patterns.

03

Scalable computation at test-time.

Abstract

Large language models (LLMs) exhibit remarkable problem-solving abilities, but struggle with complex tasks due to static internal knowledge. Retrieval-Augmented Generation (RAG) enhances access to external information, yet remains limited in multi-hop reasoning and strategic search due to rigid workflows. Recent advancements in agentic deep research empower LLMs to autonomously reason, search, and synthesize information. However, current approaches relying on outcome-based reinforcement learning (RL) face critical issues such as conflicting gradients and reward sparsity, limiting performance gains and training efficiency. To address these, we first propose Atomic Thought, a novel LLM thinking paradigm that decomposes reasoning into fine-grained functional units. These units are supervised by Reasoning Reward Models (RRMs), which provide Atomic Thought Rewards (ATR) for fine-grained…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
dikw/Atom-Searcher
model· 2 dl· ♡ 2
2 dl♡ 2

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.