Thought-Retriever: Don't Just Retrieve Raw Data, Retrieve Thoughts for Memory-Augmented Agentic Systems

Tao Feng; Pengrui Han; Guanyu Lin; Ge Liu; Jiaxuan You

arXiv:2604.12231·cs.CL·April 15, 2026

Thought-Retriever: Don't Just Retrieve Raw Data, Retrieve Thoughts for Memory-Augmented Agentic Systems

Tao Feng, Pengrui Han, Guanyu Lin, Ge Liu, Jiaxuan You

PDF

1 Repo 1 Models

TL;DR

Thought-Retriever enables LLMs to utilize an evolving internal thought memory for long-term knowledge integration, surpassing traditional retrieval methods constrained by context length.

Contribution

It introduces a model-agnostic algorithm that leverages intermediate thoughts for long-term memory, enhancing LLM performance on long-context tasks.

Findings

01

Thought-Retriever outperforms state-of-the-art baselines by at least 7.6% in F1 score.

02

It demonstrates the ability to self-evolve through user interactions.

03

The model learns to utilize deeper thoughts for complex queries.

Abstract

Large language models (LLMs) have transformed AI research thanks to their powerful internal capabilities and knowledge. However, existing LLMs still fail to effectively incorporate the massive external knowledge when interacting with the world. Although retrieval-augmented LLMs are proposed to mitigate the issue, they are still fundamentally constrained by the context length of LLMs, as they can only retrieve top-K raw data chunks from the external knowledge base which often consists of millions of data chunks. Here we propose Thought-Retriever, a novel model-agnostic algorithm that helps LLMs generate output conditioned on arbitrarily long external data, without being constrained by the context length or number of retrieved data chunks. Our key insight is to let an LLM fully leverage its intermediate responses generated when solving past user queries (thoughts), filtering meaningless…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ulab-uiuc/Thought-Retriever
github

Models

🤗
star0628/thought-retriever
model

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.