DR-Venus: Towards Frontier Edge-Scale Deep Research Agents with Only 10K Open Data
Venus Team, Sunhao Dai, Yong Deng, Jinzhen Lin, Yusheng Song, Guoqing Wang, Xiaofeng Wu, Yuqi Zhou, Shuo Yang, Zhenzhe Ying, Zhanwei Zhang, Changhua Meng, Weiqiang Wang

TL;DR
DR-Venus is a small 4B parameter deep research agent trained solely on 10K open data, combining supervised fine-tuning and reinforcement learning to outperform larger models on research benchmarks.
Contribution
The paper introduces DR-Venus, a novel small-scale deep research agent built on open data, with a new training recipe combining data quality improvements and RL enhancements.
Findings
DR-Venus outperforms prior 9B models on multiple benchmarks.
It narrows the performance gap to larger 30B systems.
Small 4B agents show strong potential for edge deployment.
Abstract
Edge-scale deep research agents based on small language models are attractive for real-world deployment due to their advantages in cost, latency, and privacy. In this work, we study how to train a strong small deep research agent under limited open-data by improving both data quality and data utilization. We present DR-Venus, a frontier 4B deep research agent for edge-scale deployment, built entirely on open data. Our training recipe consists of two stages. In the first stage, we use agentic supervised fine-tuning (SFT) to establish basic agentic capability, combining strict data cleaning with resampling of long-horizon trajectories to improve data quality and utilization. In the second stage, we apply agentic reinforcement learning (RL) to further improve execution reliability on long-horizon deep research tasks. To make RL effective for small agents in this setting, we build on IGPO…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗inclusionAI/DR-Venus-4B-RLmodel· 890 dl· ♡ 13890 dl♡ 13
- 🤗inclusionAI/DR-Venus-4B-SFT-GGUFmodel· 356 dl· ♡ 3356 dl♡ 3
- 🤗inclusionAI/DR-Venus-4B-SFTmodel· 736 dl· ♡ 7736 dl♡ 7
- 🤗inclusionAI/DR-Venus-4B-RL-GGUFmodel· 1.2k dl· ♡ 101.2k dl♡ 10
- 🤗mlx-community/DR-Venus-4B-SFT-mlx-8Bitmodel· 131 dl· ♡ 2131 dl♡ 2
- 🤗mlx-community/DR-Venus-4B-RL-mlx-8Bitmodel· 105 dl· ♡ 1105 dl♡ 1
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
