OpenR: An Open Source Framework for Advanced Reasoning with Large   Language Models

Jun Wang; Meng Fang; Ziyu Wan; Muning Wen; Jiachen Zhu; Anjie Liu,; Ziqin Gong; Yan Song; Lei Chen; Lionel M. Ni; Linyi Yang; Ying Wen; Weinan; Zhang

arXiv:2410.09671·cs.AI·October 15, 2024·2 cites

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Jun Wang, Meng Fang, Ziyu Wan, Muning Wen, Jiachen Zhu, Anjie Liu,, Ziqin Gong, Yan Song, Lei Chen, Lionel M. Ni, Linyi Yang, Ying Wen, Weinan, Zhang

PDF

Open Access 1 Repo 1 Models

TL;DR

OpenR is an open-source framework that integrates data acquisition, reinforcement learning, and non-autoregressive decoding to enhance reasoning in large language models, inspired by OpenAI's o1 model, and demonstrated with improved performance on the MATH dataset.

Contribution

OpenR is the first open-source platform to implement core techniques of OpenAI's o1 model, advancing LLM reasoning through integrated reinforcement learning and process supervision.

Findings

01

Substantial reasoning performance improvements on the MATH dataset.

02

Effective use of test-time compute and reinforcement learning for reasoning.

03

OpenR code and models are publicly available for community development.

Abstract

In this technical report, we introduce OpenR, an open-source framework designed to integrate key components for enhancing the reasoning capabilities of large language models (LLMs). OpenR unifies data acquisition, reinforcement learning training (both online and offline), and non-autoregressive decoding into a cohesive software platform. Our goal is to establish an open-source platform and community to accelerate the development of LLM reasoning. Inspired by the success of OpenAI's o1 model, which demonstrated improved reasoning abilities through step-by-step reasoning and reinforcement learning, OpenR integrates test-time compute, reinforcement learning, and process supervision to improve reasoning in LLMs. Our work is the first to provide an open-source framework that explores the core techniques of OpenAI's o1 model with reinforcement learning, achieving advanced reasoning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

openreasoner/openr
pytorch

Models

🤗
Quinn777/AtomThinkPRM
model· 13 dl· ♡ 1
13 dl♡ 1

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Semantic Web and Ontologies · Intelligent Tutoring Systems and Adaptive Learning