Attentive activation function for improving end-to-end spoofing   countermeasure systems

Woo Hyun Kang; Jahangir Alam; Abderrahim Fathan

arXiv:2205.01528·eess.AS·May 4, 2022·1 cites

Attentive activation function for improving end-to-end spoofing countermeasure systems

Woo Hyun Kang, Jahangir Alam, Abderrahim Fathan

PDF

Open Access

TL;DR

This paper introduces an attention-based activation function called AReLU to enhance end-to-end spoofing detection systems by focusing on artifact-related features, demonstrating improved performance on the ASVSpoof2019 dataset.

Contribution

The paper proposes a novel attention rectified linear unit (AReLU) activation function for spoofing detection, improving feature focus and system accuracy over traditional activation functions.

Findings

01

AReLU improves detection accuracy on the ASVSpoof2019 dataset.

02

The attention mechanism enhances relevant feature contribution.

03

The proposed method outperforms standard activation functions.

Abstract

The main objective of the spoofing countermeasure system is to detect the artifacts within the input speech caused by the speech synthesis or voice conversion process. In order to achieve this, we propose to adopt an attentive activation function, more specifically attention rectified linear unit (AReLU) to the end-to-end spoofing countermeasure system. Since the AReLU employs the attention mechanism to boost the contribution of relevant input features while suppressing the irrelevant ones, introducing AReLU can help the countermeasure system to focus on the features related to the artifacts. The proposed framework was experimented on the logical access (LA) task of ASVSpoof2019 dataset, and outperformed the systems using the standard non-learnable activation functions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Phonetics and Phonology Research