An Improved Measure of Musical Noise Based on Spectral Kurtosis
Matteo Torcoli

TL;DR
This paper introduces a new, computationally efficient measure for musical noise that better correlates with human perception in audio processing tasks like coding and source separation.
Contribution
It proposes an improved spectral kurtosis-based measure that outperforms existing baselines and approaches the accuracy of the computationally expensive APS.
Findings
New measure shows higher correlation with human perception.
Performance nearly matches the best existing measure (APS).
Proposed method is computationally efficient.
Abstract
Audio processing methods operating on a time-frequency representation of the signal can introduce unpleasant sounding artifacts known as musical noise. These artifacts are observed in the context of audio coding, speech enhancement, and source separation. The change in kurtosis of the power spectrum introduced during the processing was shown to correlate with the human perception of musical noise in the context of speech enhancement, leading to the proposal of measures based on it. These baseline measures are here shown to correlate with human perception only in a limited manner. As ground truth for the human perception, the results from two listening tests are considered: one involving audio coding and one involving source separation. Simple but effective perceptually motivated improvements are proposed and the resulting new measure is shown to clearly outperform the baselines in terms…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
