REL: Working out is all you need

Toby Simonds; Jey Han Lau; Chaithanya Bandi

arXiv:2412.04645·cs.AI·December 9, 2024

REL: Working out is all you need

Toby Simonds, Jey Han Lau, Chaithanya Bandi

PDF

Open Access 1 Repo

TL;DR

This paper introduces the Reasoning Enhancement Loop (REL), a method to improve large language models' reasoning by generating high-quality problem-solving data, inspired by human-like systematic thinking.

Contribution

It presents the REL method and a specialized dataset to enhance LLM reasoning capabilities through synthetic worked solutions.

Findings

01

REL improves LLM planning and reasoning performance.

02

High-quality reasoning data enhances model capabilities.

03

Synthetic data generation is effective for reasoning tasks.

Abstract

Recent developments, particularly OpenAI's O1 model, have demonstrated the remarkable potential of Large Language Models (LLMs) for complex reasoning tasks. Through analysis of O1's outputs and provided sample Chain-of-Thought (CoT) demonstrations, we observe that it approaches problem-solving in a distinctly human-like manner, systematically brainstorming ideas, testing hypotheses, verifying results, and planning comprehensive solutions. These sophisticated reasoning capabilities remain notably absent in other state-of-the-art language models. In this paper, we hypothesize that this performance gap stems from the limited availability of high-quality reasoning process data in current training sets. We demonstrate that by constructing a specialized dataset focused on explicit problem-solving workflows ("worked solutions"), we can elicit substantially improved planning capabilities from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tamassimonds/rel
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI-based Problem Solving and Planning · Multimodal Machine Learning Applications · Topic Modeling