TA-Mem: Tool-Augmented Autonomous Memory Retrieval for LLM in Long-Term Conversational QA
Mengwei Yuan, Jianan Liu, Jing Yang, Xianyou Li, Weiran Yan, Yichao Wu, Penghao Liang

TL;DR
TA-Mem introduces an adaptive, tool-augmented memory retrieval framework for LLMs, enhancing long-term conversational QA by dynamically extracting and utilizing structured memories, leading to improved performance and flexibility.
Contribution
The paper presents a novel framework combining adaptive memory extraction and multi-tool retrieval for LLMs, addressing limitations of static retrieval methods.
Findings
Significant performance improvements on LoCoMo dataset.
Effective adaptive memory extraction and retrieval demonstrated.
Enhanced flexibility in handling diverse question types.
Abstract
Large Language Model (LLM) has exhibited strong reasoning ability in text-based contexts across various domains, yet the limitation of context window poses challenges for the model on long-range inference tasks and necessitates a memory storage system. While many current storage approaches have been proposed with episodic notes and graph representations of memory, retrieval methods still primarily rely on predefined workflows or static similarity top-k over embeddings. To address this inflexibility, we introduced a novel tool-augmented autonomous memory retrieval framework (TA-Mem), which contains: (1) a memory extraction LLM agent which is prompted to adaptively chuck an input into sub-context based on semantic correlation, and extract information into structured notes, (2) a multi-indexed memory database designed for different types of query methods including both key-based lookup and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Text Readability and Simplification · Multimodal Machine Learning Applications
