Detecting Spoof Voices in Asian Non-Native Speech: An Indonesian and   Thai Case Study

Aulia Adila; Candy Olivia Mawalim; Masashi Unoki

arXiv:2412.01040·eess.AS·December 3, 2024·APSIPA

Detecting Spoof Voices in Asian Non-Native Speech: An Indonesian and Thai Case Study

Aulia Adila, Candy Olivia Mawalim, Masashi Unoki

PDF

Open Access

TL;DR

This paper develops spoofing detection systems for non-native Indonesian and Thai speech, highlighting the challenges and benefits of including non-native data to improve detection accuracy in diverse linguistic contexts.

Contribution

Introduces a dataset and machine learning-based spoofing detection methods tailored for non-native speech, emphasizing the importance of non-native data in training.

Findings

01

Native CM struggles with non-native speech

02

Incorporating non-native data improves detection performance

03

Proposed features and classifiers enhance spoofing detection

Abstract

This study focuses on building effective spoofing countermeasures (CMs) for non-native speech, specifically targeting Indonesian and Thai speakers. We constructed a dataset comprising both native and non-native speech to facilitate our research. Three key features (MFCC, LFCC, and CQCC) were extracted from the speech data, and three classic machine learning-based classifiers (CatBoost, XGBoost, and GMM) were employed to develop robust spoofing detection systems using the native and combined (native and non-native) speech data. This resulted in two types of CMs: Native and Combined. The performance of these CMs was evaluated on both native and non-native speech datasets. Our findings reveal significant challenges faced by Native CM in handling non-native speech, highlighting the necessity for domain-specific solutions. The proposed method shows improved detection capabilities,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSwearing, Euphemism, Multilingualism · Hate Speech and Cyberbullying Detection