Distilling Feedback into Memory-as-a-Tool

V\'ictor Gallego

arXiv:2601.05960·cs.CL·March 19, 2026

Distilling Feedback into Memory-as-a-Tool

V\'ictor Gallego

PDF

Open Access 1 Datasets

TL;DR

This paper introduces a framework that converts transient feedback into retrievable guidelines using a memory system, enabling large language models to perform reasoning more efficiently and cost-effectively.

Contribution

The paper presents a novel memory-based approach for integrating feedback into LLM reasoning, reducing inference costs compared to traditional test-time refinement methods.

Findings

01

Achieves comparable performance to test-time refinement with lower inference costs

02

Introduces the Rubric Feedback Bench dataset for rubric-based learning evaluation

03

Demonstrates rapid adaptation of augmented LLMs to feedback

Abstract

We propose a framework that amortizes the cost of inference-time reasoning by converting transient critiques into retrievable guidelines, through a file-based memory system and agent-controlled tool calls. We evaluate this method on the Rubric Feedback Bench, a novel dataset for rubric-based learning. Experiments demonstrate that our augmented LLMs rapidly match the performance of test-time refinement pipelines while drastically reducing inference cost.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

vicgalle/rubric-feedback-bench
dataset· 28 dl
28 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI)