Disentangling Hierarchical Features for Anomalous Sound Detection Under   Domain Shift

Jian Guan; Jiantong Tian; Qiaoxi Zhu; Feiyang Xiao; Hejing Zhang; Xubo; Liu

arXiv:2501.01604·cs.SD·January 6, 2025

Disentangling Hierarchical Features for Anomalous Sound Detection Under Domain Shift

Jian Guan, Jiantong Tian, Qiaoxi Zhu, Feiyang Xiao, Hejing Zhang, Xubo, Liu

PDF

Open Access

TL;DR

This paper introduces a Gradient Reversal-based Hierarchical feature Disentanglement (GRHD) method that improves anomalous sound detection under domain shift by separating domain-related and unrelated features using hierarchical structures.

Contribution

The paper proposes a novel GRHD method that disentangles hierarchical features with gradient reversal, enhancing robustness in ASD under domain shift conditions.

Findings

01

Significant performance improvement on DCASE 2022 dataset.

02

Effective separation of domain-related and unrelated features.

03

Robustness against domain shift in anomalous sound detection.

Abstract

Anomalous sound detection (ASD) encounters difficulties with domain shift, where the sounds of machines in target domains differ significantly from those in source domains due to varying operating conditions. Existing methods typically employ domain classifiers to enhance detection performance, but they often overlook the influence of domain-unrelated information. This oversight can hinder the model's ability to clearly distinguish between domains, thereby weakening its capacity to differentiate normal from abnormal sounds. In this paper, we propose a Gradient Reversal-based Hierarchical feature Disentanglement (GRHD) method to address the above challenge. GRHD uses gradient reversal to separate domain-related features from domain-unrelated ones, resulting in more robust feature representations. Additionally, the method employs a hierarchical structure to guide the learning of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Anomaly Detection Techniques and Applications · Speech and Audio Processing