SSDPT: Self-Supervised Dual-Path Transformer for Anomalous Sound Detection in Machine Condition Monitoring
Jisheng Bai, Jianfeng Chen, Mou Wang, Muhammad Saad Ayub, and Qingli, Yan

TL;DR
This paper introduces SSDPT, a self-supervised dual-path Transformer model that effectively detects anomalous machine sounds by learning from normal sounds, utilizing attention mechanisms for improved acoustic feature modeling.
Contribution
The paper presents a novel self-supervised dual-path Transformer architecture with attention modules for time and frequency modeling in anomalous sound detection.
Findings
Significant improvement in harmonic mean AUC score over state-of-the-art methods
Effective modeling of acoustic features using attention-based DPT blocks
Successful application of self-supervised learning with feature masking and reconstruction
Abstract
Anomalous sound detection for machine condition monitoring has great potential in the development of Industry 4.0. However, these anomalous sounds of machines are usually unavailable in normal conditions. Therefore, the models employed have to learn acoustic representations with normal sounds for training, and detect anomalous sounds while testing. In this article, we propose a self-supervised dual-path Transformer (SSDPT) network to detect anomalous sounds in machine monitoring. The SSDPT network splits the acoustic features into segments and employs several DPT blocks for time and frequency modeling. DPT blocks use attention modules to alternately model the interactive information about the frequency and temporal components of the segmented acoustic features. To address the problem of lack of anomalous sound, we adopt a self-supervised learning approach to train the network with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Anomaly Detection Techniques and Applications · Phonocardiography and Auscultation Techniques
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Dropout · Softmax · Adam · Absolute Position Encodings · Label Smoothing · Position-Wise Feed-Forward Layer · Convolution
