General-Reasoner: Advancing LLM Reasoning Across All Domains

Xueguang Ma; Qian Liu; Dongfu Jiang; Ge Zhang; Zejun Ma; Wenhu Chen

arXiv:2505.14652·cs.CL·June 10, 2025

General-Reasoner: Advancing LLM Reasoning Across All Domains

Xueguang Ma, Qian Liu, Dongfu Jiang, Ge Zhang, Zejun Ma, Wenhu Chen

PDF

Open Access 1 Repo 4 Models 5 Datasets

TL;DR

This paper introduces General-Reasoner, a training paradigm that enhances large language models' reasoning across diverse domains by leveraging a new dataset and a generative answer verifier, leading to improved performance on multiple benchmarks.

Contribution

The paper presents a novel training framework with a large, diverse dataset and a generative verifier, enabling LLM reasoning across broad disciplines beyond math and coding.

Findings

01

Outperforms existing methods on 12 diverse benchmarks.

02

Achieves robust reasoning in physics, chemistry, finance, and electronics.

03

Maintains high effectiveness in mathematical reasoning tasks.

Abstract

Reinforcement learning (RL) has recently demonstrated strong potential in enhancing the reasoning capabilities of large language models (LLMs). Particularly, the "Zero" reinforcement learning introduced by Deepseek-R1-Zero, enables direct RL training of base LLMs without relying on an intermediate supervised fine-tuning stage. Despite these advancements, current works for LLM reasoning mainly focus on mathematical and coding domains, largely due to data abundance and the ease of answer verification. This limits the applicability and generalization of such models to broader domains, where questions often have diverse answer representations, and data is more scarce. In this paper, we propose General-Reasoner, a novel training paradigm designed to enhance LLM reasoning capabilities across diverse domains. Our key contributions include: (1) constructing a large-scale, high-quality dataset…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tiger-ai-lab/general-reasoner
pytorch

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsFocus · Balanced Selection