Machine Anomalous Sound Detection Using Spectral-temporal Modulation Representations Derived from Machine-specific Filterbanks
Kai Li, Khalid Zaman, Xingfeng Li, Masato Akagi, and Masashi Unoki

TL;DR
This paper introduces a novel machine-specific spectral-temporal modulation approach using non-uniform filterbanks and autoencoder neural networks to improve anomalous sound detection in industrial machinery, especially under noisy conditions.
Contribution
It proposes a new spectral-temporal modulation representation derived from machine-specific filterbanks tailored by Fisher ratio analysis, enhancing ASD performance.
Findings
NUFBs improve AUC performance in noisy environments.
Spectral and temporal modulations are effective for different machine types.
Method outperforms baseline ASD techniques.
Abstract
Early detection of factory machinery malfunctions is crucial in industrial applications. In machine anomalous sound detection (ASD), different machines exhibit unique vibration-frequency ranges based on their physical properties. Meanwhile, the human auditory system is adept at tracking both temporal and spectral dynamics of machine sounds. Consequently, integrating the computational auditory models of the human auditory system with machine-specific properties can be an effective approach to machine ASD. We first quantified the frequency importances of four types of machines using the Fisher ratio (F-ratio). The quantified frequency importances were then used to design machine-specific non-uniform filterbanks (NUFBs), which extract the log non-uniform spectrum (LNS) feature. The designed NUFBs have a narrower bandwidth and higher filter distribution density in frequency regions with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Speech and Audio Processing · Anomaly Detection Techniques and Applications
MethodsSparse Evolutionary Training
