Automated Data Enrichment using Confidence-Aware Fine-Grained Debate among Open-Source LLMs for Mental Health and Online Safety

Junyu Mao; Anthony Hills; Talia Tseriotou; Maria Liakata; Aya Shamir; Dan Sayda; Dana Atzil-Slonim; Natalie Djohari; Arpan Mandal; Silke Roth; Pamela Ugwudike; Mahesan Niranjan; Stuart E. Middleton

arXiv:2512.06227·cs.CL·March 4, 2026

Automated Data Enrichment using Confidence-Aware Fine-Grained Debate among Open-Source LLMs for Mental Health and Online Safety

Junyu Mao, Anthony Hills, Talia Tseriotou, Maria Liakata, Aya Shamir, Dan Sayda, Dana Atzil-Slonim, Natalie Djohari, Arpan Mandal, Silke Roth, Pamela Ugwudike, Mahesan Niranjan, Stuart E. Middleton

PDF

Open Access

TL;DR

This paper introduces a Confidence-Aware Fine-Grained Debate framework that enhances automated multi-label data annotation for mental health and online safety, leveraging collaborative debate among LLMs to improve downstream NLP tasks.

Contribution

The paper presents a novel debate-based annotation framework and new datasets, improving multi-label data enrichment and downstream task performance in sensitive NLP applications.

Findings

01

CFD outperforms baseline annotation methods in robustness.

02

Enriched features via debate transcripts improve task accuracy by 9.9%.

03

New datasets support mental health and online safety research.

Abstract

Real-world indicators play an important role in many natural language processing (NLP) applications, such as life-event for mental health analysis and risky behaviour for online safety, yet labelling such information in training datasets is often costly and/or difficult due to their dynamic nature. Large language models (LLMs) show promising potential for automated annotation, yet multi-label prediction remains challenging. In this work, we propose a Confidence-Aware Fine-Grained Debate (CFD) framework that simulates collaborative annotation using fine-grained information to better support automated multi-label enrichment. We introduce two new expert-annotated resources: A mental health Reddit well-being dataset and an online safety Facebook sharenting risk dataset. Experiments show that CFD achieves the most robust enrichment performance compared to a range of baseline approaches. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMental Health via Writing · Topic Modeling · Digital Mental Health Interventions