Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models

Xuchen Pan; Yanxi Chen; Yushuo Chen; Yuchang Sun; Daoyuan Chen; Wenhao Zhang; Yuexiang Xie; Yilun Huang; Yilei Zhang; Dawei Gao; Weijie Shi; Yaliang Li; Bolin Ding; Jingren Zhou

arXiv:2505.17826·cs.LG·September 30, 2025

Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models

Xuchen Pan, Yanxi Chen, Yushuo Chen, Yuchang Sun, Daoyuan Chen, Wenhao Zhang, Yuexiang Xie, Yilun Huang, Yilei Zhang, Dawei Gao, Weijie Shi, Yaliang Li, Bolin Ding, Jingren Zhou

PDF

Open Access 1 Repo

TL;DR

Trinity-RFT is a versatile, modular framework that simplifies reinforcement fine-tuning of large language models, supporting various modes and applications with high efficiency.

Contribution

It introduces a unified, flexible platform for reinforcement fine-tuning of large language models, integrating multiple RFT modes and optimizing data pipelines.

Findings

01

Demonstrates high efficiency and robustness in diverse RFT scenarios.

02

Supports both on-policy and off-policy reinforcement learning.

03

Facilitates research and development of advanced RL paradigms.

Abstract

Trinity-RFT is a general-purpose, unified and easy-to-use framework designed for reinforcement fine-tuning (RFT) of large language models. It is built with a modular and decoupled design, consisting of (1) an RFT-core that unifies and generalizes synchronous/asynchronous, on-policy/off-policy, and online/offline modes of RFT; (2) seamless integration for agent-environment interaction with high efficiency and robustness; and (3) systematic data pipelines optimized for RFT. Trinity-RFT can be easily adapted for diverse application scenarios, and serves as a unified platform for development and research of advanced reinforcement learning paradigms at both macroscopic and microscopic levels. This technical report outlines the vision, features, design and implementations of Trinity-RFT, accompanied by extensive examples, applications and experiments that demonstrate its functionalities and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

modelscope/trinity-rft
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems