Adversarial Training Improves Generalization Under Distribution Shifts in Bioacoustics

Ren\'e Heinrich; Lukas Rauch; Bernhard Sick; Christoph Scholz

arXiv:2507.13727·cs.LG·July 21, 2025

Adversarial Training Improves Generalization Under Distribution Shifts in Bioacoustics

Ren\'e Heinrich, Lukas Rauch, Bernhard Sick, Christoph Scholz

PDF

TL;DR

This paper demonstrates that adversarial training, especially with output-space attacks, enhances both generalization and robustness of audio classification models under real-world distribution shifts, using bird sound data.

Contribution

It introduces and evaluates adversarial training strategies for improving model robustness and generalization in bioacoustic classification under distribution shifts.

Findings

01

Adversarial training improves clean data accuracy by 10.5%

02

Output-space attack training enhances robustness against adversarial attacks

03

Models maintain stability under targeted embedding-space attacks

Abstract

Adversarial training is a promising strategy for enhancing model robustness against adversarial attacks. However, its impact on generalization under substantial data distribution shifts in audio classification remains largely unexplored. To address this gap, this work investigates how different adversarial training strategies improve generalization performance and adversarial robustness in audio classification. The study focuses on two model architectures: a conventional convolutional neural network (ConvNeXt) and an inherently interpretable prototype-based model (AudioProtoPNet). The approach is evaluated using a challenging bird sound classification benchmark. This benchmark is characterized by pronounced distribution shifts between training and test data due to varying environmental conditions and recording methods, a common real-world challenge. The investigation explores two…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.