Improvements of Discriminative Feature Space Training for Anomalous Sound Detection in Unlabeled Conditions
Takuya Fujimura, Ibuki Kuroyanagi, and Tomoki Toda

TL;DR
This paper enhances discriminative feature space training for anomalous sound detection in unlabeled conditions by improving feature extraction and introducing pseudo-labeling methods, leading to better performance when labels are missing.
Contribution
The paper introduces a novel multi-resolution spectrogram-based feature extractor and various pseudo-labeling techniques to improve anomalous sound detection without labeled data.
Findings
Significant performance improvements with the proposed methods.
Effective feature extractor under unlabeled conditions.
Pseudo-labeling enhances training accuracy.
Abstract
In anomalous sound detection, the discriminative method has demonstrated superior performance. This approach constructs a discriminative feature space through the classification of the meta-information labels for normal sounds. This feature space reflects the differences in machine sounds and effectively captures anomalous sounds. However, its performance significantly degrades when the meta-information labels are missing. In this paper, we improve the performance of a discriminative method under unlabeled conditions by two approaches. First, we enhance the feature extractor to perform better under unlabeled conditions. Our enhanced feature extractor utilizes multi-resolution spectrograms with a new training strategy. Second, we propose various pseudo-labeling methods to effectively train the feature extractor. The experimental evaluations show that the proposed feature extractor and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Music and Audio Processing · Speech Recognition and Synthesis
