SocialMemBench: Are AI Memory Systems Ready for Social Group Settings?
Olukunle Owolabi

TL;DR
SocialMemBench is a new benchmark designed to evaluate AI memory systems' ability to handle multi-party social group settings, revealing significant gaps in current systems' capabilities.
Contribution
The paper introduces SocialMemBench, a comprehensive benchmark for multi-party social group memory, and evaluates existing AI memory frameworks, highlighting their limitations.
Findings
Current memory systems cluster in the 0.12-0.18 range, below references of 0.345 and 0.369.
A full-context Gemini 2.5 Flash reaches only 0.721 on small networks, indicating difficulty.
Benchmark reveals measurable gaps in current AI memory systems for social group settings.
Abstract
Memory systems for AI assistants were built for single-user dialogue and fail characteristically when applied to multi-party social group settings. This gap matters for the social assistants being built today: group-acting agents embedded in chat platforms, and proactive personal-assistant agents whose holistic model of a user must include their social context. Existing memory benchmarks evaluate dyadic or workplace dialogue; none targets multi-party social groups, where memory must anchor facts in shared history rather than professional roles, separate group norms from individual exceptions, and correctly attribute even after member departure. We introduce SocialMemBench, a benchmark of human-verified synthetic social group networks across five archetypes (close friends, family, recreational, interest community, acquaintance network) and three group-size tiers (4-30 members), with 430…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
