Unlocking LLMs: Addressing Scarce Data and Bias Challenges in Mental Health
Vivek Kumar, Eirini Ntoutsi, Pushpraj Singh Rajawat, Giacomo Medda and, Diego Reforgiato Recupero

TL;DR
This paper introduces IC-AnnoMI, a new expert-annotated dataset for motivational interviewing dialogues generated with LLMs, addressing challenges like bias and hallucinations in sensitive healthcare applications.
Contribution
The work presents a novel dataset and evaluation framework for LLMs in mental health, incorporating expert annotations and bias mitigation strategies in conversational therapy contexts.
Findings
IC-AnnoMI enhances LLM training with expert-annotated dialogues.
Prompting strategies influence bias reduction in LLM-generated therapy dialogues.
Augmented data improves LLM performance and reduces bias in sensitive domains.
Abstract
Large language models (LLMs) have shown promising capabilities in healthcare analysis but face several challenges like hallucinations, parroting, and bias manifestation. These challenges are exacerbated in complex, sensitive, and low-resource domains. Therefore, in this work we introduce IC-AnnoMI, an expert-annotated motivational interviewing (MI) dataset built upon AnnoMI by generating in-context conversational dialogues leveraging LLMs, particularly ChatGPT. IC-AnnoMI employs targeted prompts accurately engineered through cues and tailored information, taking into account therapy style (empathy, reflection), contextual relevance, and false semantic change. Subsequently, the dialogues are annotated by experts, strictly adhering to the Motivational Interviewing Skills Code (MISC), focusing on both the psychological and linguistic dimensions of MI dialogues. We comprehensively evaluate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare · Medical Coding and Health Information · Chronic Disease Management Strategies
