Prosody-Driven Privacy-Preserving Dementia Detection
Dominika Woszczyk, Ranya Aloufi, Soteris Demetriou

TL;DR
This paper introduces a novel method to anonymize speaker embeddings for dementia detection, effectively balancing privacy preservation with diagnostic accuracy without relying on adversarial learning or extensive classifiers.
Contribution
A new domain knowledge-based approach to disentangle prosody features, enhancing privacy while maintaining high dementia detection performance.
Findings
Speaker recognition F1-score reduced to 0.01%
Dementia detection F1-score maintained at 74%
No impact on speech naturalness
Abstract
Speaker embeddings extracted from voice recordings have been proven valuable for dementia detection. However, by their nature, these embeddings contain identifiable information which raises privacy concerns. In this work, we aim to anonymize embeddings while preserving the diagnostic utility for dementia detection. Previous studies rely on adversarial learning and models trained on the target attribute and struggle in limited-resource settings. We propose a novel approach that leverages domain knowledge to disentangle prosody features relevant to dementia from speaker embeddings without relying on a dementia classifier. Our experiments show the effectiveness of our approach in preserving speaker privacy (speaker recognition F1-score .01%) while maintaining high dementia detection score F1-score of 74% on the ADReSS dataset. Our results are also on par with a more constrained…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsUser Authentication and Security Systems · Emotion and Mood Recognition · Digital Mental Health Interventions
