Unlocking LLMs: Addressing Scarce Data and Bias Challenges in Mental   Health

Vivek Kumar; Eirini Ntoutsi; Pushpraj Singh Rajawat; Giacomo Medda and; Diego Reforgiato Recupero

arXiv:2412.12981·cs.CL·December 18, 2024

Unlocking LLMs: Addressing Scarce Data and Bias Challenges in Mental Health

Vivek Kumar, Eirini Ntoutsi, Pushpraj Singh Rajawat, Giacomo Medda and, Diego Reforgiato Recupero

PDF

Open Access 1 Repo

TL;DR

This paper introduces IC-AnnoMI, a new expert-annotated dataset for motivational interviewing dialogues generated with LLMs, addressing challenges like bias and hallucinations in sensitive healthcare applications.

Contribution

The work presents a novel dataset and evaluation framework for LLMs in mental health, incorporating expert annotations and bias mitigation strategies in conversational therapy contexts.

Findings

01

IC-AnnoMI enhances LLM training with expert-annotated dialogues.

02

Prompting strategies influence bias reduction in LLM-generated therapy dialogues.

03

Augmented data improves LLM performance and reduces bias in sensitive domains.

Abstract

Large language models (LLMs) have shown promising capabilities in healthcare analysis but face several challenges like hallucinations, parroting, and bias manifestation. These challenges are exacerbated in complex, sensitive, and low-resource domains. Therefore, in this work we introduce IC-AnnoMI, an expert-annotated motivational interviewing (MI) dataset built upon AnnoMI by generating in-context conversational dialogues leveraging LLMs, particularly ChatGPT. IC-AnnoMI employs targeted prompts accurately engineered through cues and tailored information, taking into account therapy style (empathy, reflection), contextual relevance, and false semantic change. Subsequently, the dialogues are annotated by experts, strictly adhering to the Motivational Interviewing Skills Code (MISC), focusing on both the psychological and linguistic dimensions of MI dialogues. We comprehensively evaluate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vsrana-ai/IC-AnnoMI
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare · Medical Coding and Health Information · Chronic Disease Management Strategies