Timbre Difference Capturing in Anomalous Sound Detection
Tomoya Nishida, Harsh Purohit, Kota Dohi, Takashi Endo, Yohei, Kawaguchi

TL;DR
This paper introduces a novel framework for explaining anomalous machine sounds by quantifying timbre attribute differences without requiring anomalous sound training data, aiding machine condition monitoring.
Contribution
It proposes a timbre difference explanation method that does not need anomalous sounds for training, using psycho-acoustical models and a k-NN based approach for accurate detection and explanation.
Findings
Effective timbre difference estimation without anomalous training data
Improved anomalous sound detection accuracy on MIMII DG dataset
Framework based on psycho-acoustical models and k-NN method
Abstract
This paper proposes a framework of explaining anomalous machine sounds in the context of anomalous sound detection~(ASD). While ASD has been extensively explored, identifying how anomalous sounds differ from normal sounds is also beneficial for machine condition monitoring. However, existing sound difference captioning methods require anomalous sounds for training, which is impractical in typical machine condition monitoring settings where such sounds are unavailable. To solve this issue, we propose a new strategy for explaining anomalous differences that does not require anomalous sounds for training. Specifically, we introduce a framework that explains differences in predefined timbre attributes instead of using free-form text captions. Objective metrics of timbre attributes can be computed using timbral models developed through psycho-acoustical research, enabling the estimation of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Speech and Audio Processing · Music Technology and Sound Studies
