A-MBER: Affective Memory Benchmark for Emotion Recognition
Deliang Wen, Ke Sun, Yu Wang

TL;DR
A-MBER is a new benchmark designed to evaluate AI models' ability to interpret current emotional states based on remembered multi-session interaction history, addressing a gap in existing emotion recognition resources.
Contribution
It introduces a comprehensive benchmark with explicit intermediate steps to assess models' use of long-term memory for emotion recognition in multi-session interactions.
Findings
A-MBER effectively discriminates models based on their memory and reasoning capabilities.
Memory enhances affective interpretation through selective and context-sensitive use of past interactions.
Results highlight the importance of structured memory and grounded reasoning in emotion recognition.
Abstract
AI assistants that interact with users over time need to interpret the user's current emotional state in order to respond appropriately and personally. However, this capability remains insufficiently evaluated. Existing emotion datasets mainly assess local or instantaneous affect, while long-term memory benchmarks focus largely on factual recall, temporal consistency, or knowledge updating. As a result, current resources provide limited support for testing whether a model can use remembered interaction history to interpret a user's present affective state. We introduce A-MBER, an Affective Memory Benchmark for Emotion Recognition, to evaluate this capability. A-MBER focuses on present affective interpretation grounded in remembered multi-session interaction history. Given an interaction trajectory and a designated anchor turn, a model must infer the user's current affective state,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
