Your Model Is Not Predicting Depression Well And That Is Why: A Case   Study of PRIMATE Dataset

Kirill Milintsevich (1; 2); Kairit Sirts (2); Ga\"el Dias (1) ((1); University of Caen Normandy; (2) University of Tartu)

arXiv:2403.00438·cs.CL·March 4, 2024·2 cites

Your Model Is Not Predicting Depression Well And That Is Why: A Case Study of PRIMATE Dataset

Kirill Milintsevich (1, 2), Kairit Sirts (2), Ga\"el Dias (1) ((1), University of Caen Normandy, (2) University of Tartu)

PDF

Open Access

TL;DR

This study critically examines the PRIMATE dataset's annotation quality for depression detection, revealing issues with labels and proposing refined annotations by mental health experts to improve NLP model reliability.

Contribution

The paper highlights annotation validity concerns in mental health datasets and introduces expert reannotations for better depression and anhedonia detection.

Findings

01

Identification of false positives in PRIMATE annotations

02

Refined annotations improve dataset quality

03

Expert reannotation enhances model reliability

Abstract

This paper addresses the quality of annotations in mental health datasets used for NLP-based depression level estimation from social media texts. While previous research relies on social media-based datasets annotated with binary categories, i.e. depressed or non-depressed, recent datasets such as D2S and PRIMATE aim for nuanced annotations using PHQ-9 symptoms. However, most of these datasets rely on crowd workers without the domain knowledge for annotation. Focusing on the PRIMATE dataset, our study reveals concerns regarding annotation validity, particularly for the lack of interest or pleasure symptom. Through reannotation by a mental health professional, we introduce finer labels and textual spans as evidence, identifying a notable number of false positives. Our refined annotations, to be released under a Data Use Agreement, offer a higher-quality test set for anhedonia detection.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMental Health via Writing · Digital Mental Health Interventions · Mental Health Research Topics

MethodsSparse Evolutionary Training