Enhanced Fuzzy Decomposition of Sound Into Sines, Transients, and Noise
Leonardo Fierro, Vesa V\"alim\"aki

TL;DR
This paper introduces an enhanced fuzzy logic-based method for decomposing sounds into sines, transients, and noise, improving transient separation and preserving audio quality for various applications.
Contribution
It presents a novel fuzzy decomposition technique that allows soft classification of spectral components, enhancing transient separation and maintaining perfect reconstruction.
Findings
Better transient separation with minimal energy leakage
Comparable or improved audio quality in subjective tests
Effective integration with time-scale modification methods
Abstract
The decomposition of sounds into sines, transients, and noise is a long-standing research problem in audio processing. The current solutions for this three-way separation detect either horizontal and vertical structures or anisotropy and orientations in the spectrogram to identify the properties of each spectral bin and classify it as sinusoidal, transient, or noise. This paper proposes an enhanced three-way decomposition method based on fuzzy logic, enabling soft masking while preserving the perfect reconstruction property. The proposed method allows each spectral bin to simultaneously belong to two classes, sine and noise or transient and noise. Results of a subjective listening test against three other techniques are reported, showing that the proposed decomposition yields a better or comparable quality. The main improvement appears in transient separation, which enjoys little or no…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Blind Source Separation Techniques · Image and Signal Denoising Methods
MethodsTest
