Adversarial Training Improves Generalization Under Distribution Shifts in Bioacoustics
Ren\'e Heinrich, Lukas Rauch, Bernhard Sick, Christoph Scholz

TL;DR
This paper demonstrates that adversarial training, especially with output-space attacks, enhances both generalization and robustness of audio classification models under real-world distribution shifts, using bird sound data.
Contribution
It introduces and evaluates adversarial training strategies for improving model robustness and generalization in bioacoustic classification under distribution shifts.
Findings
Adversarial training improves clean data accuracy by 10.5%
Output-space attack training enhances robustness against adversarial attacks
Models maintain stability under targeted embedding-space attacks
Abstract
Adversarial training is a promising strategy for enhancing model robustness against adversarial attacks. However, its impact on generalization under substantial data distribution shifts in audio classification remains largely unexplored. To address this gap, this work investigates how different adversarial training strategies improve generalization performance and adversarial robustness in audio classification. The study focuses on two model architectures: a conventional convolutional neural network (ConvNeXt) and an inherently interpretable prototype-based model (AudioProtoPNet). The approach is evaluated using a challenging bird sound classification benchmark. This benchmark is characterized by pronounced distribution shifts between training and test data due to varying environmental conditions and recording methods, a common real-world challenge. The investigation explores two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
