MemLoRA: Distilling Expert Adapters for On-Device Memory Systems
Massimo Bini, Ondrej Bohdal, Umberto Michieli, Zeynep Akata, Mete Ozay, Taha Ceritli

TL;DR
MemLoRA introduces a memory system with adapters for small language models, enabling efficient on-device memory operations and multimodal visual understanding, outperforming larger models in text and visual tasks.
Contribution
The paper presents MemLoRA, a novel memory system with adapters for small models, and extends it with MemLoRA-V for visual understanding, enabling on-device multimodal memory operations.
Findings
Outperforms larger models on text-only memory tasks.
Significantly improves visual question answering accuracy.
Maintains strong performance in text tasks while enabling visual reasoning.
Abstract
Memory-augmented Large Language Models (LLMs) have demonstrated remarkable consistency during prolonged dialogues by storing relevant memories and incorporating them as context. Such memory-based personalization is also key in on-device settings that allow users to keep their conversations and data private. However, memory-augmented systems typically rely on LLMs that are too costly for local on-device deployment. Even though Small Language Models (SLMs) are more suitable for on-device inference than LLMs, they cannot achieve sufficient performance. Additionally, these LLM-based systems lack native visual capabilities, limiting their applicability in multimodal contexts. In this paper, we introduce (i) MemLoRA, a novel memory system that enables local deployment by equipping SLMs with specialized memory adapters, and (ii) its vision extension MemLoRA-V, which integrates small…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Advanced Neural Network Applications
