Underneath the Numbers: Quantitative and Qualitative Gender Fairness in   LLMs for Depression Prediction

Micol Spitale; Jiaee Cheong; Hatice Gunes

arXiv:2406.08183·cs.CL·June 17, 2024·3 cites

Underneath the Numbers: Quantitative and Qualitative Gender Fairness in LLMs for Depression Prediction

Micol Spitale, Jiaee Cheong, Hatice Gunes

PDF

Open Access

TL;DR

This study investigates gender bias in large language models used for depression prediction, comparing quantitative and qualitative fairness, and proposing new methods for qualitative bias assessment.

Contribution

It is the first to evaluate gender bias in LLMs for depression detection using both quantitative and qualitative approaches, and introduces strategies for qualitative fairness analysis.

Findings

01

ChatGPT performs best across various metrics

02

LLaMA 2 outperforms others in group fairness

03

Qualitative analysis reveals themes and explanations used by LLMs

Abstract

Recent studies show bias in many machine learning models for depression detection, but bias in LLMs for this task remains unexplored. This work presents the first attempt to investigate the degree of gender bias present in existing LLMs (ChatGPT, LLaMA 2, and Bard) using both quantitative and qualitative approaches. From our quantitative evaluation, we found that ChatGPT performs the best across various performance metrics and LLaMA 2 outperforms other LLMs in terms of group fairness metrics. As qualitative fairness evaluation remains an open research question we propose several strategies (e.g., word count, thematic analysis) to investigate whether and how a qualitative evaluation can provide valuable insights for bias analysis beyond what is possible with quantitative evaluation. We found that ChatGPT consistently provides a more comprehensive, well-reasoned explanation for its…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMental Health Research Topics · Sex and Gender in Healthcare · Digital Mental Health Interventions

MethodsLLaMA