Vulnerabilities in Machine Learning-Based Voice Disorder Detection   Systems

Gianpaolo Perelli; Andrea Panzino; Roberto Casula; Marco Micheletto,; Giulia Orr\`u; Gian Luca Marcialis

arXiv:2410.16341·cs.CR·October 23, 2024

Vulnerabilities in Machine Learning-Based Voice Disorder Detection Systems

Gianpaolo Perelli, Andrea Panzino, Roberto Casula, Marco Micheletto,, Giulia Orr\`u, Gian Luca Marcialis

PDF

Open Access

TL;DR

This paper investigates the security vulnerabilities of machine learning systems used for voice disorder detection, demonstrating how various attack methods can compromise their reliability and highlighting the need for improved security measures.

Contribution

It introduces an analysis of attack strategies against voice disorder detection models, revealing their susceptibility and emphasizing the importance of securing healthcare AI systems.

Findings

01

Adversarial and evasion attacks can significantly reduce detection accuracy

02

Pitching attacks can alter classification outcomes

03

Certain attack methods are more effective against current models

Abstract

The impact of voice disorders is becoming more widely acknowledged as a public health issue. Several machine learning-based classifiers with the potential to identify disorders have been used in recent studies to differentiate between normal and pathological voices and sounds. In this paper, we focus on analyzing the vulnerabilities of these systems by exploring the possibility of attacks that can reverse classification and compromise their reliability. Given the critical nature of personal health information, understanding which types of attacks are effective is a necessary first step toward improving the security of such systems. Starting from the original audios, we implement various attack methods, including adversarial, evasion, and pitching techniques, and evaluate how state-of-the-art disorder detection models respond to them. Our findings identify the most effective attack…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis

MethodsFocus