Spatial Audio Signal Enhancement: A Multi-output MVDR Method in The Spherical Harmonic-domain

Huawei Zhang; Jihui Zhang; Huiyuan Sun; Prasanga Samarasinghe

arXiv:2409.03269·eess.AS·September 3, 2025·APSIPA

Spatial Audio Signal Enhancement: A Multi-output MVDR Method in The Spherical Harmonic-domain

Huawei Zhang, Jihui Zhang, Huiyuan Sun, Prasanga Samarasinghe

PDF

Open Access

TL;DR

This paper introduces a novel spherical harmonic domain MVDR method for spatial audio enhancement that effectively reduces interference and preserves spatial cues in reverberant environments, outperforming existing approaches in simulation.

Contribution

The paper proposes a new multi-output MVDR approach using Relative Harmonic Coefficients for improved spatial audio signal enhancement in reverberant conditions.

Findings

01

Lower estimation error compared to baseline

02

Higher speech-distortion-ratio (SDR) achieved

03

Comparable noise reduction within the sweet area

Abstract

Spatial audio signal enhancement aims to reduce interfering source contributions while preserving the desired sound field with its spatial cues. Existing methods generally rely on impractical assumptions (e.g. accurate estimations of impractical information) or have limited applicability. This paper presents a spherical harmonic (SH)-domain minimum variance distortionless response (MVDR)-based spatial signal enhancer using Relative Harmonic Coefficients (ReHCs) to extract clean SH coefficients from noisy recordings in reverberant environments. A simulation study shows the proposed method achieves lower estimation error, higher speech-distortion-ratio (SDR), and comparable noise reduction (NR) within the sweet area in a reverberant environment, compared to a beamforming-and-projection method as the baseline.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Advanced Adaptive Filtering Techniques · Vehicle Noise and Vibration Control