Style Transfer as Bias Mitigation: Diffusion Models for Synthetic Mental Health Text for Arabic

Saad Mankarious; Aya Zirikly

arXiv:2601.14124·cs.CL·January 21, 2026

Style Transfer as Bias Mitigation: Diffusion Models for Synthetic Mental Health Text for Arabic

Saad Mankarious, Aya Zirikly

PDF

Open Access

TL;DR

This paper introduces a diffusion model approach for synthetic Arabic mental health text generation that mitigates gender bias by performing style transfer without pretrained language models, enhancing diversity and fairness.

Contribution

It presents a novel, pretraining-free diffusion-based style transfer method for bias mitigation in low-resource mental health text data, specifically targeting gender imbalance in Arabic.

Findings

01

High semantic fidelity in generated text

02

Effective gender style transfer demonstrated

03

Mitigates bias without pretrained LLMs

Abstract

Synthetic data offers a promising solution for mitigating data scarcity and demographic bias in mental health analysis, yet existing approaches largely rely on pretrained large language models (LLMs), which may suffer from limited output diversity and propagate biases inherited from their training data. In this work, we propose a pretraining-free diffusion-based approach for synthetic text generation that frames bias mitigation as a style transfer problem. Using the CARMA Arabic mental health corpus, which exhibits a substantial gender imbalance, we focus on male-to-female style transfer to augment underrepresented female-authored content. We construct five datasets capturing varying linguistic and semantic aspects of gender expression in Arabic and train separate diffusion models for each setting. Quantitative evaluations demonstrate consistently high semantic fidelity between source…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMental Health via Writing · Computational and Text Analysis Methods · Digital Mental Health Interventions