Diversity-Robust Acoustic Feature Signatures Based on Multiscale Fractal Dimension for Similarity Search of Environmental Sounds
Motohiro Sunouchi, Masaharu Yoshioka

TL;DR
This paper introduces multiscale fractal dimension-based acoustic signatures, including MFD-VL, that are robust for environmental sound similarity search, improving performance and stability against noise and source fluctuations.
Contribution
It extends previous fractal dimension signatures with kernel density estimation and proposes a new long-range MFD-VL signature for environmental sound analysis.
Findings
MFD-VL improves long-term feature stability.
Signatures outperform traditional features in noisy environments.
Combined features enhance similarity search accuracy.
Abstract
This paper proposes new acoustic feature signatures based on the multiscale fractal dimension (MFD), which are robust against the diversity of environmental sounds, for the content-based similarity search. The diversity of sound sources and acoustic compositions is a typical feature of environmental sounds. Several acoustic features have been proposed for environmental sounds. Among them is the widely-used Mel-Frequency Cepstral Coefficients (MFCCs), which describes frequency-domain features. However, in addition to these features in the frequency domain, environmental sounds have other important features in the time domain with various time scales. In our previous paper, we proposed enhanced multiscale fractal dimension signature (EMFD) for environmental sounds. This paper extends EMFD by using the kernel density estimation method, which results in better performance of the similarity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
