CarMem: Enhancing Long-Term Memory in LLM Voice Assistants through   Category-Bounding

Johannes Kirmayr; Lukas Stappen; Phillip Schneider; Florian Matthes,; Elisabeth Andr\'e

arXiv:2501.09645·cs.AI·January 17, 2025

CarMem: Enhancing Long-Term Memory in LLM Voice Assistants through Category-Bounding

Johannes Kirmayr, Lukas Stappen, Phillip Schneider, Florian Matthes,, Elisabeth Andr\'e

PDF

Open Access 1 Repo

TL;DR

This paper introduces CarMem, a long-term memory system for voice assistants that uses category-based preference extraction to improve personalization, privacy, and trust, validated on a synthetic dataset with high accuracy and reduced redundancies.

Contribution

The paper presents a novel category-bounding memory system for voice assistants that enhances preference retention, privacy, and transparency using LLMs, with a new synthetic dataset for benchmarking.

Findings

01

Preference extraction F1-score of .78 to .95

02

Reduces redundant preferences by 95%

03

Achieves retrieval accuracy of .87

Abstract

In today's assistant landscape, personalisation enhances interactions, fosters long-term relationships, and deepens engagement. However, many systems struggle with retaining user preferences, leading to repetitive user requests and disengagement. Furthermore, the unregulated and opaque extraction of user preferences in industry applications raises significant concerns about privacy and trust, especially in regions with stringent regulations like Europe. In response to these challenges, we propose a long-term memory system for voice assistants, structured around predefined categories. This approach leverages Large Language Models to efficiently extract, store, and retrieve preferences within these categories, ensuring both personalisation and transparency. We also introduce a synthetic multi-turn, multi-session conversation dataset (CarMem), grounded in real industry data, tailored to an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

johanneskirmayr/carmem
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems