Klear-AgentForge: Forging Agentic Intelligence through Posttraining Scaling

Qi Wang; Hongzhi Zhang; Jia Fu; Kai Fu; Yahui Liu; Tinghai Zhang; Chenxi Sun; Gangwei Jiang; Jingyi Tang; Xingguang Ji; Yang Yue; Jingyuan Zhang; Fuzheng Zhang; Kun Gai; Guorui Zhou

arXiv:2511.05951·cs.AI·November 11, 2025

Klear-AgentForge: Forging Agentic Intelligence through Posttraining Scaling

Qi Wang, Hongzhi Zhang, Jia Fu, Kai Fu, Yahui Liu, Tinghai Zhang, Chenxi Sun, Gangwei Jiang, Jingyi Tang, Xingguang Ji, Yang Yue, Jingyuan Zhang, Fuzheng Zhang, Kun Gai, Guorui Zhou

PDF

Open Access

TL;DR

This paper introduces Klear-Qwen3-AgentForge, an open-source pipeline for training high-performance agentic language models capable of tool use and coding, achieving state-of-the-art results among similar-sized models.

Contribution

It presents a comprehensive, open-source training pipeline combining supervised fine-tuning and reinforcement learning for agentic models starting from Qwen3-8B.

Findings

01

Achieves state-of-the-art performance among similar-sized models.

02

Remains competitive with larger models.

03

Effective for diverse agentic tasks.

Abstract

Despite the proliferation of powerful agentic models, the lack of critical post-training details hinders the development of strong counterparts in the open-source community. In this study, we present a comprehensive and fully open-source pipeline for training a high-performance agentic model for interacting with external tools and environments, named Klear-Qwen3-AgentForge, starting from the Qwen3-8B base model. We design effective supervised fine-tuning (SFT) with synthetic data followed by multi-turn reinforcement learning (RL) to unlock the potential for multiple diverse agentic tasks. We perform exclusive experiments on various agentic benchmarks in both tool use and coding domains. Klear-Qwen3-AgentForge-8B achieves state-of-the-art performance among LLMs of similar size and remains competitive with significantly larger models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Multimodal Machine Learning Applications · Robot Manipulation and Learning