Controlling Whisper: Universal Acoustic Adversarial Attacks to Control   Speech Foundation Models

Vyas Raina; Mark Gales

arXiv:2407.04482·cs.SD·October 14, 2024·1 cites

Controlling Whisper: Universal Acoustic Adversarial Attacks to Control Speech Foundation Models

Vyas Raina, Mark Gales

PDF

Open Access 1 Repo

TL;DR

This paper reveals that speech foundation models like Whisper are vulnerable to universal adversarial acoustic segments that can manipulate their task output without model prompt access, highlighting security concerns.

Contribution

It introduces a novel universal acoustic adversarial attack method that can control speech foundation models' behavior regardless of prompts, exposing new security vulnerabilities.

Findings

01

Universal acoustic segments can override Whisper's task setting.

02

Adversarial segments enable control over speech translation vs. transcription.

03

The attack works without prompt access, demonstrating a significant security risk.

Abstract

Speech enabled foundation models, either in the form of flexible speech recognition based systems or audio-prompted large language models (LLMs), are becoming increasingly popular. One of the interesting aspects of these models is their ability to perform tasks other than automatic speech recognition (ASR) using an appropriate prompt. For example, the OpenAI Whisper model can perform both speech transcription and speech translation. With the development of audio-prompted LLMs there is the potential for even greater control options. In this work we demonstrate that with this greater flexibility the systems can be susceptible to model-control adversarial attacks. Without any access to the model prompt it is possible to modify the behaviour of the system by appropriately changing the audio input. To illustrate this risk, we demonstrate that it is possible to prepend a short universal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rainavyas/prepend_acoustic_attack
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning

MethodsSparse Evolutionary Training