ARIES: Autonomous Reasoning with LLMs on Interactive Thought Graph   Environments

Pedro Gimenes; Zeyu Cao; Jeffrey Wong; Yiren Zhao

arXiv:2502.21208·cs.AI·March 3, 2025

ARIES: Autonomous Reasoning with LLMs on Interactive Thought Graph Environments

Pedro Gimenes, Zeyu Cao, Jeffrey Wong, Yiren Zhao

PDF

1 Models

TL;DR

ARIES introduces a multi-agent framework where LLMs act as policy agents to dynamically guide thought graph transformations, significantly improving reasoning accuracy and efficiency without requiring supervised fine-tuning.

Contribution

This work pioneers using off-the-shelf LLMs as policy agents in a multi-agent architecture for reasoning, eliminating the need for pre-defined transformation schedules and fine-tuning.

Findings

01

Up to 29% higher accuracy on HumanEval

02

Reduced inference costs by 35%

03

No search requirements needed

Abstract

Recent research has shown that LLM performance on reasoning tasks can be enhanced by scaling test-time compute. One promising approach, particularly with decomposable problems, involves arranging intermediate solutions as a graph on which transformations are performed to explore the solution space. However, prior works rely on pre-determined, task-specific transformation schedules which are subject to a set of searched hyperparameters. In this work, we view thought graph transformations as actions in a Markov decision process, and implement policy agents to drive effective action policies for the underlying reasoning LLM agent. In particular, we investigate the ability for another LLM to act as a policy agent on thought graph environments and introduce ARIES, a multi-agent architecture for reasoning with LLMs. In ARIES, reasoning LLM agents solve decomposed subproblems, while policy LLM…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
shiviktech/The_teacher
model

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.