Binaspect -- A Python Library for Binaural Audio Analysis, Visualization & Feature Generation
Dan Barry, Davoud Shariat Panah, Alessandro Ragano, Jan Skoglund, Andrew Hines

TL;DR
Binaspect is an open-source Python library that analyzes, visualizes, and generates features from binaural audio, enabling detailed inspection of spatial cues and degradations without prior head model knowledge.
Contribution
The paper introduces Binaspect, a novel tool for blind binaural audio analysis that produces interpretable azimuth maps and features for downstream tasks.
Findings
Effectively visualizes binaural cue degradations.
Distinguishes multiple active sources as distinct clusters.
Reveals effects of codecs and rendering methods on spatial cues.
Abstract
We present Binaspect, an open-source Python library for binaural audio analysis, visualization, and feature generation. Binaspect generates interpretable "azimuth maps" by calculating modified interaural time and level difference spectrograms, and clustering those time-frequency (TF) bins into stable time-azimuth histogram representations. This allows multiple active sources to appear as distinct azimuthal clusters, while degradations manifest as broadened, diffused, or shifted distributions. Crucially, Binaspect operates blindly on audio, requiring no prior knowledge of head models. These visualizations enable researchers and engineers to observe how binaural cues are degraded by codec and renderer design choices, among other downstream processes. We demonstrate the tool on bitrate ladders, ambisonic rendering, and VBAP source positioning, where degradations are clearly revealed. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
