Mitigating Semantic Drift: Evaluating LLMs' Efficacy in Psychotherapy through MI Dialogue Summarization

Vivek Kumar; Pushpraj Singh Rajawat; Eirini Ntoutsi

arXiv:2511.22818·cs.CL·December 1, 2025

Mitigating Semantic Drift: Evaluating LLMs' Efficacy in Psychotherapy through MI Dialogue Summarization

Vivek Kumar, Pushpraj Singh Rajawat, Eirini Ntoutsi

PDF

Open Access

TL;DR

This paper evaluates large language models' ability to accurately summarize motivational interviewing dialogues in psychotherapy, addressing challenges like semantic drift and providing insights for their effective use in sensitive, low-resource domains.

Contribution

It introduces a novel evaluation framework using MI dialogue summaries and a multi-stage annotation scheme based on the MITI framework, along with a high-quality dataset for low-resource psychological domains.

Findings

01

LLMs show varying capacity to capture psychological constructs

02

Prompting techniques influence model performance

03

Best practices can mitigate semantic drift in therapy contexts

Abstract

Recent advancements in large language models (LLMs) have shown their potential across both general and domain-specific tasks. However, there is a growing concern regarding their lack of sensitivity, factual incorrectness in responses, inconsistent expressions of empathy, bias, hallucinations, and overall inability to capture the depth and complexity of human understanding, especially in low-resource and sensitive domains such as psychology. To address these challenges, our study employs a mixed-methods approach to evaluate the efficacy of LLMs in psychotherapy. We use LLMs to generate precise summaries of motivational interviewing (MI) dialogues and design a two-stage annotation scheme based on key components of the Motivational Interviewing Treatment Integrity (MITI) framework, namely evocation, collaboration, autonomy, direction, empathy, and a non-judgmental attitude. Using…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMental Health via Writing · Topic Modeling · Digital Mental Health Interventions