Auditing Support Strategies in LLMs through Grounded Multi-Turn Social Simulation
Michelle Star, Andrew Aquilina, Yu-Ru Lin

TL;DR
This paper presents a multi-turn social simulation framework to evaluate how large language models provide social support, revealing dynamics invisible to single-turn assessments.
Contribution
It introduces a novel multi-turn simulation approach and uses SSBC coding to analyze support strategies and their relation to user distress in LLMs.
Findings
Support strategies decline as estimated distress increases.
Support composition shifts systematically with user distress across models.
Community context influences support behavior more than demographic factors.
Abstract
When users seek social support from chatbots, they disclose their situation gradually, yet most evaluations of supportive LLMs rely on single-turn, fully specified prompts. We introduce a multi-turn simulation framework that closes this gap. Support-seeking narratives from five Reddit communities are decomposed into ordered fragments and revealed turn by turn to a language model. Each response is coded with the Social Support Behavior Code (SSBC), an established multi-label taxonomy that captures the composition of support, rather than a single quality score. To ask whether support choices track the model's own construal of user distress, we use linear probes on hidden representations to estimate this internal signal without altering the generation context. Across two mid-scale models (Llama-3.1-8B, OLMo-3-7B) and more than 6,200 turns, support composition shifts systematically with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
