Defend Data Poisoning Attacks on Voice Authentication

Ke Li; Cameron Baird; Dan Lin

arXiv:2209.04547·cs.CR·July 11, 2023

Defend Data Poisoning Attacks on Voice Authentication

Ke Li, Cameron Baird, Dan Lin

PDF

Open Access

TL;DR

This paper demonstrates a simple data poisoning attack on voice authentication systems and proposes Guardian, a CNN-based discriminator, that effectively detects attacked accounts with high accuracy.

Contribution

It introduces a novel data poisoning attack on voice authentication and a robust defense method called Guardian that significantly improves detection accuracy.

Findings

01

Guardian detects about 95% of attacked accounts

02

Existing defenses only achieve around 60% accuracy

03

The attack is easy to implement and hard to defend against

Abstract

With the advances in deep learning, speaker verification has achieved very high accuracy and is gaining popularity as a type of biometric authentication option in many scenes of our daily life, especially the growing market of web services. Compared to traditional passwords, "vocal passwords" are much more convenient as they relieve people from memorizing different passwords. However, new machine learning attacks are putting these voice authentication systems at risk. Without a strong security guarantee, attackers could access legitimate users' web accounts by fooling the deep neural network (DNN) based voice recognition models. In this paper, we demonstrate an easy-to-implement data poisoning attack to the voice authentication system, which can hardly be captured by existing defense mechanisms. Thus, we propose a more robust defense method, called Guardian, which is a convolutional…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Hate Speech and Cyberbullying Detection · Natural Language Processing Techniques