If It's Good Enough for You, It's Good Enough for Me: Transferability of Audio Sufficiencies across Models
David A. Kelly, Hana Chockler

TL;DR
This paper introduces transferability analysis to study how minimal, sufficient audio signals for classification transfer across different models, revealing task-dependent transferability rates and model-specific behaviors.
Contribution
It defines transferability of sufficient signals and conducts a large study across three audio classification tasks, uncovering task-specific transferability patterns and model differences.
Findings
Music genre transferability rate is approximately 26%.
Deepfake detection models show high variance in transferability.
Transferability analysis uncovers information-theoretic differences beyond accuracy.
Abstract
In order to gain fresh insights about the information processing characteristics of different audio classification models, we propose transferability analysis. Given a minimal, sufficient signal for a classification on a model , transferability analysis asks whether other models accept this minimal signal as having the same classification as it did on . We define what it means for a sufficient signal to be transferable and perform a large study over different classification tasks: music genre, emotion recognition and deepfake detection. We find that transferability rates vary depending on the task, with sufficient signals for music genre being transferable of the time. The other tasks reveal much higher variance in transferability and reveal that some models, in particular on deepfake detection, have different transferability behavior. We call these models…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
