Listen2Scene: Interactive material-aware binaural sound propagation for reconstructed 3D scenes
Anton Ratnarajah, Dinesh Manocha

TL;DR
Listen2Scene introduces a neural-network-based binaural sound propagation method for VR/AR that efficiently generates realistic acoustic effects for reconstructed 3D indoor environments, enhancing audio realism in real-time.
Contribution
It presents a novel graph neural network and CGAN framework for scene-aware acoustic effect generation, handling mesh artifacts and enabling fast, plausible binaural audio rendering.
Findings
Achieves acoustic effect generation in 0.1 ms on GPU.
Outperforms prior geometric and learning-based methods in plausibility.
Validated with perceptual and quantitative evaluations.
Abstract
We present an end-to-end binaural audio rendering approach (Listen2Scene) for virtual reality (VR) and augmented reality (AR) applications. We propose a novel neural-network-based binaural sound propagation method to generate acoustic effects for indoor 3D models of real environments. Any clean audio or dry audio can be convolved with the generated acoustic effects to render audio corresponding to the real environment. We propose a graph neural network that uses both the material and the topology information of the 3D scenes and generates a scene latent vector. Moreover, we use a conditional generative adversarial network (CGAN) to generate acoustic effects from the scene latent vector. Our network can handle holes or other artifacts in the reconstructed 3D mesh model. We present an efficient cost function for the generator network to incorporate spatial audio effects. Given the source…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Hearing Loss and Rehabilitation · Music and Audio Processing
MethodsGraph Neural Network
