An Enhanced Audio Feature Tailored for Anomalous Sound Detection Based on Pre-trained Models

Guirui Zhong; Qing Wang; Jun Du; Lei Wang; Mingqi Cai; and Xin Fang

arXiv:2508.15334·cs.SD·August 22, 2025

An Enhanced Audio Feature Tailored for Anomalous Sound Detection Based on Pre-trained Models

Guirui Zhong, Qing Wang, Jun Du, Lei Wang, Mingqi Cai, and Xin Fang

PDF

Open Access

TL;DR

This paper introduces a novel audio feature tailored for anomalous sound detection that leverages pre-trained models and a filter bank design to improve detection accuracy by focusing on all frequency ranges and removing redundant noise.

Contribution

It proposes a new filter bank-based audio feature and a parameter-free enhancement method utilizing pre-trained models, advancing ASD performance and transfer learning capabilities.

Findings

01

Significant performance improvements on DCASE 2024 dataset

02

Effective removal of redundant noise in machine sounds

03

Enhanced detection of anomalies across all frequency ranges

Abstract

Anomalous Sound Detection (ASD) aims at identifying anomalous sounds from machines and has gained extensive research interests from both academia and industry. However, the uncertainty of anomaly location and much redundant information such as noise in machine sounds hinder the improvement of ASD system performance. This paper proposes a novel audio feature of filter banks with evenly distributed intervals, ensuring equal attention to all frequency ranges in the audio, which enhances the detection of anomalies in machine sounds. Moreover, based on pre-trained models, this paper presents a parameter-free feature enhancement approach to remove redundant information in machine audio. It is believed that this parameter-free strategy facilitates the effective transfer of universal knowledge from pre-trained tasks to the ASD task during model fine-tuning. Evaluation results on the Detection…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Music and Audio Processing · Time Series Analysis and Forecasting