ClaritySpeech: Dementia Obfuscation in Speech

Dominika Woszczyk; Ranya Aloufi; Soteris Demetriou

arXiv:2507.09282·cs.CL·July 15, 2025

ClaritySpeech: Dementia Obfuscation in Speech

Dominika Woszczyk, Ranya Aloufi, Soteris Demetriou

PDF

Open Access

TL;DR

ClaritySpeech is a novel framework that obfuscates dementia-affected speech to improve privacy and accessibility by integrating ASR, text obfuscation, and zero-shot TTS without fine-tuning.

Contribution

It introduces a new dementia obfuscation system that preserves speaker identity and enhances speech quality in low-data environments, addressing privacy and accessibility challenges.

Findings

01

Achieves 16% and 10% reduction in F1 score across settings

02

Improves WER from 0.73 to 0.08 and 0.15

03

Enhances speech quality from 1.65 to ~2.15

Abstract

Dementia, a neurodegenerative disease, alters speech patterns, creating communication barriers and raising privacy concerns. Current speech technologies, such as automatic speech transcription (ASR), struggle with dementia and atypical speech, further challenging accessibility. This paper presents a novel dementia obfuscation in speech framework, ClaritySpeech, integrating ASR, text obfuscation, and zero-shot text-to-speech (TTS) to correct dementia-affected speech while preserving speaker identity in low-data environments without fine-tuning. Results show a 16% and 10% drop in mean F1 score across various adversarial settings and modalities (audio, text, fusion) for ADReSS and ADReSSo, respectively, maintaining 50% speaker similarity. We also find that our system improves WER (from 0.73 to 0.08 for ADReSS and 0.15 for ADReSSo) and speech quality from 1.65 to ~2.15, enhancing privacy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Speech and dialogue systems · Robotics and Automated Systems