DR-Venus: Towards Frontier Edge-Scale Deep Research Agents with Only 10K Open Data

Venus Team; Sunhao Dai; Yong Deng; Jinzhen Lin; Yusheng Song; Guoqing Wang; Xiaofeng Wu; Yuqi Zhou; Shuo Yang; Zhenzhe Ying; Zhanwei Zhang; Changhua Meng; Weiqiang Wang

arXiv:2604.19859·cs.LG·April 23, 2026

DR-Venus: Towards Frontier Edge-Scale Deep Research Agents with Only 10K Open Data

Venus Team, Sunhao Dai, Yong Deng, Jinzhen Lin, Yusheng Song, Guoqing Wang, Xiaofeng Wu, Yuqi Zhou, Shuo Yang, Zhenzhe Ying, Zhanwei Zhang, Changhua Meng, Weiqiang Wang

PDF

1 Repo 6 Models

TL;DR

DR-Venus is a small 4B parameter deep research agent trained solely on 10K open data, combining supervised fine-tuning and reinforcement learning to outperform larger models on research benchmarks.

Contribution

The paper introduces DR-Venus, a novel small-scale deep research agent built on open data, with a new training recipe combining data quality improvements and RL enhancements.

Findings

01

DR-Venus outperforms prior 9B models on multiple benchmarks.

02

It narrows the performance gap to larger 30B systems.

03

Small 4B agents show strong potential for edge deployment.

Abstract

Edge-scale deep research agents based on small language models are attractive for real-world deployment due to their advantages in cost, latency, and privacy. In this work, we study how to train a strong small deep research agent under limited open-data by improving both data quality and data utilization. We present DR-Venus, a frontier 4B deep research agent for edge-scale deployment, built entirely on open data. Our training recipe consists of two stages. In the first stage, we use agentic supervised fine-tuning (SFT) to establish basic agentic capability, combining strict data cleaning with resampling of long-horizon trajectories to improve data quality and utilization. In the second stage, we apply agentic reinforcement learning (RL) to further improve execution reliability on long-horizon deep research tasks. To make RL effective for small agents in this setting, we build on IGPO…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

inclusionai/DR-Venus
github

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.