Autonomous In-Situ Soundscape Augmentation via Joint Selection of Masker and Gain
Karn N. Watcharasupat, Kenneth Ooi, Bhan Lam, Trevor Wong, Zhen-Ting, Ong, and Woon-Seng Gan

TL;DR
This paper presents a deep learning system for real-time, in-situ soundscape augmentation that jointly selects maskers and gain levels to enhance acoustic comfort, validated on a large subjective response dataset.
Contribution
It introduces a modular deep learning model for joint masker and gain selection, enabling quick inference and feature-domain augmentation, reducing computational costs and pre-calibration needs.
Findings
Model accurately predicts perceptual pleasantness of soundscapes.
System outperforms traditional static selection methods.
Validated on over 440 participants' responses.
Abstract
The selection of maskers and playback gain levels in a soundscape augmentation system is crucial to its effectiveness in improving the overall acoustic comfort of a given environment. Traditionally, the selection of appropriate maskers and gain levels has been informed by expert opinion, which may not representative of the target population, or by listening tests, which can be time-consuming and labour-intensive. Furthermore, the resulting static choices of masker and gain are often inflexible to the dynamic nature of real-world soundscapes. In this work, we utilized a deep learning model to perform joint selection of the optimal masker and its gain level for a given soundscape. The proposed model was designed with highly modular building blocks, allowing for an optimized inference process that can quickly search through a large number of masker and gain combinations. In addition, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
