Learning to Reason from Feedback at Test-Time

Yanyang Li; Michael Lyu; Liwei Wang

arXiv:2502.15771·cs.LG·May 30, 2025

Learning to Reason from Feedback at Test-Time

Yanyang Li, Michael Lyu, Liwei Wang

PDF

Open Access 1 Repo

TL;DR

This paper introduces FTTT and OpTune, a new framework and optimizer for better feedback utilization in large language models during test-time reasoning, improving scalability and performance.

Contribution

It presents a novel test-time feedback optimization paradigm and a learnable optimizer, addressing limitations of existing methods in feedback utilization for LLMs.

Findings

01

FTTT and OpTune outperform existing methods in reasoning tasks.

02

Enhanced scalability and accuracy demonstrated across multiple datasets.

03

Effective feedback exploitation improves test-time reasoning performance.

Abstract

Solving complex tasks in a single attempt is challenging for large language models (LLMs). Iterative interaction with the environment and feedback is often required to achieve success, making effective feedback utilization a critical topic. Existing approaches either struggle with length generalization or rely on naive retries without leveraging prior information. In this paper, we introduce FTTT, a novel paradigm that formulates feedback utilization as an optimization problem at test time. Additionally, we propose a learnable test-time optimizer, OpTune, to effectively exploit feedback. Experiments on two LLMs across four reasoning datasets demonstrate that FTTT and OpTune achieve superior scalability and performance.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lavi-lab/fttt
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques