Can Emotion Fool Anti-spoofing?

Aurosweta Mahapatra; Ismail Rasim Ulgen; Abinay Reddy Naini; Carlos Busso; Berrak Sisman

arXiv:2505.23962·eess.AS·June 2, 2025

Can Emotion Fool Anti-spoofing?

Aurosweta Mahapatra, Ismail Rasim Ulgen, Abinay Reddy Naini, Carlos Busso, Berrak Sisman

PDF

Open Access

TL;DR

This paper introduces EmoSpoof-TTS, a new emotional speech dataset, and GEM, a gated ensemble model that enhances anti-spoofing robustness against emotionally expressive synthetic speech, revealing current models' vulnerabilities.

Contribution

It provides the EmoSpoof-TTS dataset and proposes GEM, a novel emotion-aware anti-spoofing model, addressing the gap in robustness against emotional synthetic speech.

Findings

01

Existing models struggle with emotional speech

02

GEM improves detection across all emotions

03

Emotional data impacts anti-spoofing performance

Abstract

Traditional anti-spoofing focuses on models and datasets built on synthetic speech with mostly neutral state, neglecting diverse emotional variations. As a result, their robustness against high-quality, emotionally expressive synthetic speech is uncertain. We address this by introducing EmoSpoof-TTS, a corpus of emotional text-to-speech samples. Our analysis shows existing anti-spoofing models struggle with emotional synthetic speech, exposing risks of emotion-targeted attacks. Even trained on emotional data, the models underperform due to limited focus on emotional aspect and show performance disparities across emotions. This highlights the need for emotion-focused anti-spoofing paradigm in both dataset and methodology. We propose GEM, a gated ensemble of emotion-specialized models with a speech emotion recognition gating network. GEM performs effectively across all emotions and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCybersecurity and Cyber Warfare Studies · Freedom of Expression and Defamation

MethodsFocus