Fixed-Persona SLMs with Modular Memory: Scalable NPC Dialogue on Consumer Hardware
Martin Braas, Lukas Esterle

TL;DR
This paper introduces a modular NPC dialogue system using small language models with runtime-swappable memory modules, enabling scalable, expressive, and long-term interactions on consumer hardware for gaming and beyond.
Contribution
It presents a novel modular architecture combining fine-tuned small language models with dynamic memory modules for scalable, persona-specific NPC dialogues without retraining.
Findings
Effective long-term memory retention in NPCs.
Compatibility with multiple open-source SLMs on consumer hardware.
Potential applications beyond gaming in various conversational domains.
Abstract
Large Language Models (LLMs) have demonstrated remarkable capabilities in generating human-like text, yet their applicability to dialogue systems in computer games remains limited. This limitation arises from their substantial hardware requirements, latency constraints, and the necessity to maintain clearly defined knowledge boundaries within a game setting. In this paper, we propose a modular NPC dialogue system that leverages Small Language Models (SLMs), fine-tuned to encode specific NPC personas and integrated with runtime-swappable memory modules. These memory modules preserve character-specific conversational context and world knowledge, enabling expressive interactions and long-term memory without retraining or model reloading during gameplay. We comprehensively evaluate our system using three open-source SLMs: DistilGPT-2, TinyLlama-1.1B-Chat, and Mistral-7B-Instruct, trained on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPersona Design and Applications · AI in Service Interactions · Topic Modeling
