Perceptual Noise-Masking with Music through Deep Spectral Envelope   Shaping

Cl\'ementine Berger (IP Paris; IDS; S2A); Roland Badeau (IP Paris,; IDS; S2A); Slim Essid (IP Paris; IDS; S2A)

arXiv:2502.17527·cs.SD·February 26, 2025

Perceptual Noise-Masking with Music through Deep Spectral Envelope Shaping

Cl\'ementine Berger (IP Paris, IDS, S2A), Roland Badeau (IP Paris,, IDS, S2A), Slim Essid (IP Paris, IDS, S2A)

PDF

Open Access 1 Repo

TL;DR

This paper introduces a neural network that reshapes music's spectral envelope to improve noise masking in noisy environments, enhancing listening experience while preserving audio quality.

Contribution

A novel psychoacoustic masking model using deep spectral envelope shaping to enhance noise masking in music through neural network training.

Findings

01

Improved noise masking performance over existing methods.

02

Effective preservation of original music quality.

03

Validated on simulated noisy listening scenarios.

Abstract

People often listen to music in noisy environments, seeking to isolate themselves from ambient sounds. Indeed, a music signal can mask some of the noise's frequency components due to the effect of simultaneous masking. In this article, we propose a neural network based on a psychoacoustic masking model, designed to enhance the music's ability to mask ambient noise by reshaping its spectral envelope with predicted filter frequency responses. The model is trained with a perceptual loss function that balances two constraints: effectively masking the noise while preserving the original music mix and the user's chosen listening level. We evaluate our approach on simulated data replicating a user's experience of listening to music with headphones in a noisy environment. The results, based on defined objective metrics, demonstrate that our system improves the state of the art.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ClementineBerger/DPNMM
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage and Signal Denoising Methods · Speech and Audio Processing · Music and Audio Processing