Toward Improving Synthetic Audio Spoofing Detection Robustness via Meta-Learning and Disentangled Training With Adversarial Examples
Zhenyu Wang, John H.L. Hansen

TL;DR
This paper introduces a robust spoofing detection system for automatic speaker verification that leverages meta-learning, adversarial data augmentation, and attention mechanisms to improve detection of unseen spoofing attacks.
Contribution
It proposes a novel combination of weighted loss, meta-learning, adversarial examples, and attention modules to enhance spoofing detection robustness against unseen attacks.
Findings
Achieved a pooled EER of 0.87% on ASVspoof 2019 LA dataset.
Reduced min t-DCF to 0.0277, indicating improved detection performance.
Demonstrated effectiveness of adversarial augmentation and meta-learning in spoofing detection.
Abstract
Advances in automatic speaker verification (ASV) promote research into the formulation of spoofing detection systems for real-world applications. The performance of ASV systems can be degraded severely by multiple types of spoofing attacks, namely, synthetic speech (SS), voice conversion (VC), replay, twins and impersonation, especially in the case of unseen synthetic spoofing attacks. A reliable and robust spoofing detection system can act as a security gate to filter out spoofing attacks instead of having them reach the ASV system. A weighted additive angular margin loss is proposed to address the data imbalance issue, and different margins has been assigned to improve generalization to unseen spoofing attacks in this study. Meanwhile, we incorporate a meta-learning loss function to optimize differences between the embeddings of support versus query set in order to learn a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Softmax · Attention Is All You Need · Sparse Evolutionary Training · Convolution · Batch Normalization · Residual Connection · Residual Block · Auxiliary Batch Normalization
