CarMem: Enhancing Long-Term Memory in LLM Voice Assistants through Category-Bounding
Johannes Kirmayr, Lukas Stappen, Phillip Schneider, Florian Matthes,, Elisabeth Andr\'e

TL;DR
This paper introduces CarMem, a long-term memory system for voice assistants that uses category-based preference extraction to improve personalization, privacy, and trust, validated on a synthetic dataset with high accuracy and reduced redundancies.
Contribution
The paper presents a novel category-bounding memory system for voice assistants that enhances preference retention, privacy, and transparency using LLMs, with a new synthetic dataset for benchmarking.
Findings
Preference extraction F1-score of .78 to .95
Reduces redundant preferences by 95%
Achieves retrieval accuracy of .87
Abstract
In today's assistant landscape, personalisation enhances interactions, fosters long-term relationships, and deepens engagement. However, many systems struggle with retaining user preferences, leading to repetitive user requests and disengagement. Furthermore, the unregulated and opaque extraction of user preferences in industry applications raises significant concerns about privacy and trust, especially in regions with stringent regulations like Europe. In response to these challenges, we propose a long-term memory system for voice assistants, structured around predefined categories. This approach leverages Large Language Models to efficiently extract, store, and retrieve preferences within these categories, ensuring both personalisation and transparency. We also introduce a synthetic multi-turn, multi-session conversation dataset (CarMem), grounded in real industry data, tailored to an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems
