Explainable anomaly detection for sound spectrograms using pooling statistics with quantile differences
Nicolas Thewes, Philipp Steinhauer, Patrick Trampert, Markus Pauly, Georg Schneider

TL;DR
This paper introduces an explainable anomaly detection method for sound spectrograms that uses pooling statistics and quantile differences, suitable for industrial applications requiring transparent AI solutions.
Contribution
The paper presents a novel, statistically motivated anomaly detection approach for spectrograms that offers intrinsic explainability, addressing industrial needs for transparent AI.
Findings
Effective detection of anomalies in sound spectrograms.
Provides interpretable results suitable for industrial use.
Outperforms some existing methods in accuracy and explainability.
Abstract
Anomaly detection is the task of identifying rarely occurring (i.e. anormal or anomalous) samples that differ from almost all other samples in a dataset. As the patterns of anormal samples are usually not known a priori, this task is highly challenging. Consequently, anomaly detection lies between semi- and unsupervised learning. The detection of anomalies in sound data, often called 'ASD' (Anomalous Sound Detection), is a sub-field that deals with the identification of new and yet unknown effects in acoustic recordings. It is of great importance for various applications in Industry 4.0. Here, vibrational or acoustic data are typically obtained from standard sensor signals used for predictive maintenance. Examples cover machine condition monitoring or quality assurance to track the state of components or products. However, the use of intelligent algorithms remains a controversial topic.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Time Series Analysis and Forecasting · Music and Audio Processing
