Domain Shift-oriented Machine Anomalous Sound Detection Model Based on Self-Supervised Learning
Jing-ke Yan, Xin Wang, Qin Wang, Qin Qin, Huang-he Li, Peng-fei Ye,, Yue-ping He, Jing Zeng

TL;DR
This paper introduces a novel self-supervised learning model, TranSelf-DyGCN, that effectively detects anomalous machine sounds under domain shifts by modeling spatial-temporal features, inter-dependencies, and domain adaptation.
Contribution
The paper proposes a domain shift-oriented detection model combining a time-frequency feature network, dynamic graph convolution, and domain adaptation, improving stability and adaptability in unsupervised settings.
Findings
Enhanced detection stability under domain shifts
Effective modeling of inter-dependencies between features
Improved performance on DCASE datasets
Abstract
Thanks to the development of deep learning, research on machine anomalous sound detection based on self-supervised learning has made remarkable achievements. However, there are differences in the acoustic characteristics of the test set and the training set under different operating conditions of the same machine (domain shifts). It is challenging for the existing detection methods to learn the domain shifts features stably with low computation overhead. To address these problems, we propose a domain shift-oriented machine anomalous sound detection model based on self-supervised learning (TranSelf-DyGCN) in this paper. Firstly, we design a time-frequency domain feature modeling network to capture global and local spatial and time-domain features, thus improving the stability of machine anomalous sound detection stability under domain shifts. Then, we adopt a Dynamic Graph Convolutional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Speech and Audio Processing
