Unveiling and Mitigating Bias in Mental Health Analysis with Large   Language Models

Yuqing Wang; Yun Zhao; Sara Alessandra Keller; Anne de Hond; Marieke; M. van Buchem; Malvika Pillai; Tina Hernandez-Boussard

arXiv:2406.12033·cs.CL·June 21, 2024·3 cites

Unveiling and Mitigating Bias in Mental Health Analysis with Large Language Models

Yuqing Wang, Yun Zhao, Sara Alessandra Keller, Anne de Hond, Marieke, M. van Buchem, Malvika Pillai, Tina Hernandez-Boussard

PDF

Open Access 1 Repo

TL;DR

This paper systematically evaluates biases in large language models used for mental health analysis, revealing disparities across social factors and demonstrating that tailored prompts can mitigate these biases effectively.

Contribution

It provides a comprehensive bias evaluation across multiple social factors and proposes fairness-aware prompting methods to reduce bias in mental health predictions.

Findings

01

GPT-4 balances performance and fairness better than other LLMs.

02

Fairness-aware prompts significantly reduce bias in predictions.

03

Domain-specific models like MentalRoBERTa still outperform LLMs in some cases.

Abstract

The advancement of large language models (LLMs) has demonstrated strong capabilities across various applications, including mental health analysis. However, existing studies have focused on predictive performance, leaving the critical issue of fairness underexplored, posing significant risks to vulnerable populations. Despite acknowledging potential biases, previous works have lacked thorough investigations into these biases and their impacts. To address this gap, we systematically evaluate biases across seven social factors (e.g., gender, age, religion) using ten LLMs with different prompting methods on eight diverse mental health datasets. Our results show that GPT-4 achieves the best overall balance in performance and fairness among LLMs, although it still lags behind domain-specific models like MentalRoBERTa in some cases. Additionally, our tailored fairness-aware prompts can…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

eternityyw/biaseval-llm-mentalhealth
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMental Health via Writing

MethodsResidual Connection · Softmax · Layer Normalization · Byte Pair Encoding · Label Smoothing · Adam · Attention Is All You Need · Linear Layer · Multi-Head Attention · Position-Wise Feed-Forward Layer