Improvements of Discriminative Feature Space Training for Anomalous   Sound Detection in Unlabeled Conditions

Takuya Fujimura; Ibuki Kuroyanagi; and Tomoki Toda

arXiv:2409.09332·eess.AS·September 17, 2024

Improvements of Discriminative Feature Space Training for Anomalous Sound Detection in Unlabeled Conditions

Takuya Fujimura, Ibuki Kuroyanagi, and Tomoki Toda

PDF

Open Access 1 Repo

TL;DR

This paper enhances discriminative feature space training for anomalous sound detection in unlabeled conditions by improving feature extraction and introducing pseudo-labeling methods, leading to better performance when labels are missing.

Contribution

The paper introduces a novel multi-resolution spectrogram-based feature extractor and various pseudo-labeling techniques to improve anomalous sound detection without labeled data.

Findings

01

Significant performance improvements with the proposed methods.

02

Effective feature extractor under unlabeled conditions.

03

Pseudo-labeling enhances training accuracy.

Abstract

In anomalous sound detection, the discriminative method has demonstrated superior performance. This approach constructs a discriminative feature space through the classification of the meta-information labels for normal sounds. This feature space reflects the differences in machine sounds and effectively captures anomalous sounds. However, its performance significantly degrades when the meta-information labels are missing. In this paper, we improve the performance of a discriminative method under unlabeled conditions by two approaches. First, we enhance the feature extractor to perform better under unlabeled conditions. Our enhanced feature extractor utilizes multi-resolution spectrograms with a new training strategy. Second, we propose various pseudo-labeling methods to effectively train the feature extractor. The experimental evaluations show that the proposed feature extractor and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

TakuyaFujimura/unlabeledasd
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Music and Audio Processing · Speech Recognition and Synthesis