Tongyi DeepResearch Technical Report

Tongyi DeepResearch Team: Baixuan Li; Bo Zhang; Dingchu Zhang; Fei Huang; Guangyu Li; Guoxin Chen; Huifeng Yin; Jialong Wu; Jingren Zhou; Kuan Li; Liangcai Su; Litu Ou; Liwen Zhang; Pengjun Xie; Rui Ye; Wenbiao Yin; Xinmiao Yu; Xinyu Wang; Xixi Wu; Xuanzhong Chen; Yida Zhao; Zhen Zhang; Zhengwei Tao; Zhongwang Zhang; Zile Qiao; Chenxi Wang; Donglei Yu; Gang Fu; Haiyang Shen; Jiayin Yang; Jun Lin; Junkai Zhang; Kui Zeng; Li Yang; Hailong Yin; Maojia Song; Ming Yan; Minpeng Liao; Peng Xia; Qian Xiao; Rui Min; Ruixue Ding; Runnan Fang; Shaowei Chen; Shen Huang; Shihang Wang; Shihao Cai; Weizhou Shen; Xiaobin Wang; Xin Guan; Xinyu Geng; Yingcheng Shi; Yuning Wu; Zhuo Chen; Zijian Li; Yong Jiang

arXiv:2510.24701·cs.CL·May 19, 2026

Tongyi DeepResearch Technical Report

Tongyi DeepResearch Team: Baixuan Li, Bo Zhang, Dingchu Zhang, Fei Huang, Guangyu Li, Guoxin Chen, Huifeng Yin, Jialong Wu, Jingren Zhou, Kuan Li, Liangcai Su, Litu Ou, Liwen Zhang, Pengjun Xie, Rui Ye, Wenbiao Yin, Xinmiao Yu, Xinyu Wang, Xixi Wu, Xuanzhong Chen, Yida Zhao

PDF

TL;DR

Tongyi DeepResearch is a large language model designed for complex, long-horizon research tasks, developed through an end-to-end training framework with automatic data synthesis, achieving state-of-the-art results and open-sourcing its resources.

Contribution

The paper introduces a novel agentic large language model with a scalable automatic training pipeline for deep research tasks, setting new performance benchmarks.

Findings

01

Achieves state-of-the-art performance on multiple deep research benchmarks.

02

Uses a highly scalable, fully automatic data synthesis pipeline.

03

Features a 30.5 billion parameter model with efficient activation.

Abstract

We present Tongyi DeepResearch, an agentic large language model, which is specifically designed for long-horizon, deep information-seeking research tasks. To incentivize autonomous deep research agency, Tongyi DeepResearch is developed through an end-to-end training framework that combines agentic mid-training and agentic post-training, enabling scalable reasoning and information seeking across complex tasks. We design a highly scalable data synthesis pipeline that is fully automatic, without relying on costly human annotation, and empowers all training stages. By constructing customized environments for each stage, our system enables stable and consistent interactions throughout. Tongyi DeepResearch, featuring 30.5 billion total parameters, with only 3.3 billion activated per token, achieves state-of-the-art performance across a range of agentic deep research benchmarks, including…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.