Human-Inspired Context-Selective Multimodal Memory for Social Robots
Hangyeol Kang, Slava Voloshynovskiy, Nadia Magnenat Thalmann

TL;DR
This paper introduces a human-inspired, context-selective multimodal memory system for social robots that enhances personalized, natural interactions by prioritizing emotionally salient and novel experiences.
Contribution
It presents a novel memory architecture that combines multimodal episodic recall with context-based prioritization, inspired by cognitive neuroscience, for improved social robot interactions.
Findings
Achieved a Spearman correlation of 0.506 in memory selectivity, surpassing human consistency.
Improved Recall@1 by up to 13% in multimodal retrieval tasks.
Maintains real-time performance in social scenarios.
Abstract
Memory is fundamental to social interaction, enabling humans to recall meaningful past experiences and adapt their behavior accordingly based on the context. However, most current social robots and embodied agents rely on non-selective, text-based memory, limiting their ability to support personalized, context-aware interactions. Drawing inspiration from cognitive neuroscience, we propose a context-selective, multimodal memory architecture for social robots that captures and retrieves both textual and visual episodic traces, prioritizing moments characterized by high emotional salience or scene novelty. By associating these memories with individual users, our system enables socially personalized recall and more natural, grounded dialogue. We evaluate the selective storage mechanism using a curated dataset of social scenarios, achieving a Spearman correlation of 0.506, surpassing human…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
