Disentangling Hierarchical Features for Anomalous Sound Detection Under Domain Shift
Jian Guan, Jiantong Tian, Qiaoxi Zhu, Feiyang Xiao, Hejing Zhang, Xubo, Liu

TL;DR
This paper introduces a Gradient Reversal-based Hierarchical feature Disentanglement (GRHD) method that improves anomalous sound detection under domain shift by separating domain-related and unrelated features using hierarchical structures.
Contribution
The paper proposes a novel GRHD method that disentangles hierarchical features with gradient reversal, enhancing robustness in ASD under domain shift conditions.
Findings
Significant performance improvement on DCASE 2022 dataset.
Effective separation of domain-related and unrelated features.
Robustness against domain shift in anomalous sound detection.
Abstract
Anomalous sound detection (ASD) encounters difficulties with domain shift, where the sounds of machines in target domains differ significantly from those in source domains due to varying operating conditions. Existing methods typically employ domain classifiers to enhance detection performance, but they often overlook the influence of domain-unrelated information. This oversight can hinder the model's ability to clearly distinguish between domains, thereby weakening its capacity to differentiate normal from abnormal sounds. In this paper, we propose a Gradient Reversal-based Hierarchical feature Disentanglement (GRHD) method to address the above challenge. GRHD uses gradient reversal to separate domain-related features from domain-unrelated ones, resulting in more robust feature representations. Additionally, the method employs a hierarchical structure to guide the learning of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Anomaly Detection Techniques and Applications · Speech and Audio Processing
