SocialMemBench: Are AI Memory Systems Ready for Social Group Settings?

Olukunle Owolabi

arXiv:2605.17789·cs.CL·May 19, 2026

SocialMemBench: Are AI Memory Systems Ready for Social Group Settings?

Olukunle Owolabi

PDF

TL;DR

SocialMemBench is a new benchmark designed to evaluate AI memory systems' ability to handle multi-party social group settings, revealing significant gaps in current systems' capabilities.

Contribution

The paper introduces SocialMemBench, a comprehensive benchmark for multi-party social group memory, and evaluates existing AI memory frameworks, highlighting their limitations.

Findings

01

Current memory systems cluster in the 0.12-0.18 range, below references of 0.345 and 0.369.

02

A full-context Gemini 2.5 Flash reaches only 0.721 on small networks, indicating difficulty.

03

Benchmark reveals measurable gaps in current AI memory systems for social group settings.

Abstract

Memory systems for AI assistants were built for single-user dialogue and fail characteristically when applied to multi-party social group settings. This gap matters for the social assistants being built today: group-acting agents embedded in chat platforms, and proactive personal-assistant agents whose holistic model of a user must include their social context. Existing memory benchmarks evaluate dyadic or workplace dialogue; none targets multi-party social groups, where memory must anchor facts in shared history rather than professional roles, separate group norms from individual exceptions, and correctly attribute even after member departure. We introduce SocialMemBench, a benchmark of human-verified synthetic social group networks across five archetypes (close friends, family, recreational, interest community, acquaintance network) and three group-size tiers (4-30 members), with 430…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.