Mitigating Semantic Drift: Evaluating LLMs' Efficacy in Psychotherapy through MI Dialogue Summarization
Vivek Kumar, Pushpraj Singh Rajawat, Eirini Ntoutsi

TL;DR
This paper evaluates large language models' ability to accurately summarize motivational interviewing dialogues in psychotherapy, addressing challenges like semantic drift and providing insights for their effective use in sensitive, low-resource domains.
Contribution
It introduces a novel evaluation framework using MI dialogue summaries and a multi-stage annotation scheme based on the MITI framework, along with a high-quality dataset for low-resource psychological domains.
Findings
LLMs show varying capacity to capture psychological constructs
Prompting techniques influence model performance
Best practices can mitigate semantic drift in therapy contexts
Abstract
Recent advancements in large language models (LLMs) have shown their potential across both general and domain-specific tasks. However, there is a growing concern regarding their lack of sensitivity, factual incorrectness in responses, inconsistent expressions of empathy, bias, hallucinations, and overall inability to capture the depth and complexity of human understanding, especially in low-resource and sensitive domains such as psychology. To address these challenges, our study employs a mixed-methods approach to evaluate the efficacy of LLMs in psychotherapy. We use LLMs to generate precise summaries of motivational interviewing (MI) dialogues and design a two-stage annotation scheme based on key components of the Motivational Interviewing Treatment Integrity (MITI) framework, namely evocation, collaboration, autonomy, direction, empathy, and a non-judgmental attitude. Using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMental Health via Writing · Topic Modeling · Digital Mental Health Interventions
