A Study on Leveraging Search and Self-Feedback for Agent Reasoning

Karthikeyan K; Michelle Yuan; Elman Mansimov; Katerina Margatina,; Anurag Pratik; Daniele Bonadiman; Monica Sunkara; Yi Zhang; Yassine Benajiba

arXiv:2502.12094·cs.AI·February 18, 2025

A Study on Leveraging Search and Self-Feedback for Agent Reasoning

Karthikeyan K, Michelle Yuan, Elman Mansimov, Katerina Margatina,, Anurag Pratik, Daniele Bonadiman, Monica Sunkara, Yi Zhang, Yassine Benajiba

PDF

Open Access 1 Video

TL;DR

This paper investigates how search algorithms and self-feedback mechanisms can enhance reasoning in language agents, highlighting the importance of ground-truth or well-designed feedback for effective reasoning across different tasks.

Contribution

It provides a comparative analysis of ground-truth and self-feedback during search and proposes task-specific feedback strategies to improve reasoning in complex tasks.

Findings

01

Self-feedback can limit generalization in reasoning tasks.

02

Ground-truth feedback significantly improves search effectiveness.

03

Task-specific feedback mechanisms are necessary for complex reasoning.

Abstract

Recent works have demonstrated that incorporating search during inference can significantly improve reasoning capabilities of language agents. Some approaches may make use of the ground truth or rely on model's own generated feedback. The search algorithm uses this feedback to then produce values that will update its criterion for exploring and exploiting various reasoning paths. In this study, we investigate how search and model's self-feedback can be leveraged for reasoning tasks. First, we explore differences in ground-truth feedback and self-feedback during search for math reasoning. Second, we observe limitations in applying search techniques to more complex tasks like tool-calling and design domain-specific approaches to address these gaps. Our experiments reveal challenges related to generalization when solely relying on self-feedback during search. For search to work…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

A Study on Leveraging Search and Self-Feedback for Agent Reasoning· underline

Taxonomy

TopicsMulti-Agent Systems and Negotiation · Reinforcement Learning in Robotics · AI-based Problem Solving and Planning