Defending against Adversarial Audio via Diffusion Model
Shutong Wu, Jiongxiao Wang, Wei Ping, Weili Nie, Chaowei Xiao

TL;DR
This paper introduces AudioPure, a diffusion model-based defense that purifies adversarial audio, significantly improving robustness of acoustic systems against various attacks without retraining.
Contribution
Proposes a plug-and-play diffusion model approach for adversarial audio purification, enhancing robustness without fine-tuning or retraining of classifiers.
Findings
Outperforms existing defenses under strong adaptive attacks
Achieves up to +20% in robust accuracy
Provides higher certified robustness via randomized smoothing
Abstract
Deep learning models have been widely used in commercial acoustic systems in recent years. However, adversarial audio examples can cause abnormal behaviors for those acoustic systems, while being hard for humans to perceive. Various methods, such as transformation-based defenses and adversarial training, have been proposed to protect acoustic systems from adversarial attacks, but they are less effective against adaptive attacks. Furthermore, directly applying the methods from the image domain can lead to suboptimal results because of the unique properties of audio data. In this paper, we propose an adversarial purification-based defense pipeline, AudioPure, for acoustic systems via off-the-shelf diffusion models. Taking advantage of the strong generation ability of diffusion models, AudioPure first adds a small amount of noise to the adversarial audio and then runs the reverse sampling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Speech Recognition and Synthesis · Music and Audio Processing
MethodsDiffusion
