TL;DR
Thought-Retriever enables LLMs to utilize an evolving internal thought memory for long-term knowledge integration, surpassing traditional retrieval methods constrained by context length.
Contribution
It introduces a model-agnostic algorithm that leverages intermediate thoughts for long-term memory, enhancing LLM performance on long-context tasks.
Findings
Thought-Retriever outperforms state-of-the-art baselines by at least 7.6% in F1 score.
It demonstrates the ability to self-evolve through user interactions.
The model learns to utilize deeper thoughts for complex queries.
Abstract
Large language models (LLMs) have transformed AI research thanks to their powerful internal capabilities and knowledge. However, existing LLMs still fail to effectively incorporate the massive external knowledge when interacting with the world. Although retrieval-augmented LLMs are proposed to mitigate the issue, they are still fundamentally constrained by the context length of LLMs, as they can only retrieve top-K raw data chunks from the external knowledge base which often consists of millions of data chunks. Here we propose Thought-Retriever, a novel model-agnostic algorithm that helps LLMs generate output conditioned on arbitrarily long external data, without being constrained by the context length or number of retrieved data chunks. Our key insight is to let an LLM fully leverage its intermediate responses generated when solving past user queries (thoughts), filtering meaningless…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
