Mental Health Equity in LLMs: Leveraging Multi-Hop Question Answering to Detect Amplified and Silenced Perspectives
Batool Haider, Atmika Gorti, Aman Chadha, Manas Gaur

TL;DR
This paper presents a multi-hop question answering framework to detect and reduce biases in large language models used for mental health, revealing systematic disparities and proposing effective debiasing techniques.
Contribution
It introduces a novel multi-hop QA method to systematically identify intersectional biases in LLMs for mental health and demonstrates effective bias mitigation strategies.
Findings
Systematic bias patterns identified across demographics and mental health conditions.
Multi-hop QA outperforms conventional methods in bias detection.
Debiasing techniques reduced biases by 66-94%.
Abstract
Large Language Models (LLMs) in mental healthcare risk propagating biases that reinforce stigma and harm marginalized groups. While previous research identified concerning trends, systematic methods for detecting intersectional biases remain limited. This work introduces a multi-hop question answering (MHQA) framework to explore LLM response biases in mental health discourse. We analyze content from the Interpretable Mental Health Instruction (IMHI) dataset across symptom presentation, coping mechanisms, and treatment approaches. Using systematic tagging across age, race, gender, and socioeconomic status, we investigate bias patterns at demographic intersections. We evaluate four LLMs: Claude 3.5 Sonnet, Jamba 1.6, Gemma 3, and Llama 4, revealing systematic disparities across sentiment, demographics, and mental health conditions. Our MHQA approach demonstrates superior detection…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLLaMA
