TT-Net: Dual-path transformer based sound field translation in the spherical harmonic domain
Yiwen Wang, Zijian Lan, Xihong Wu, Tianshu Qu

TL;DR
This paper introduces TT-Net, a dual-path transformer neural network that improves sound field translation accuracy in the spherical harmonic domain by addressing stability issues and enabling better handling of multiple sampling points.
Contribution
The paper proposes a novel dual-path transformer architecture for sound field translation that enhances stability and accuracy over traditional methods based on spherical harmonic analysis.
Findings
Extended frequency and distance range for sound field translation.
Achieved more accurate higher-order spherical harmonic coefficients.
Improved stability and robustness of translation matrices.
Abstract
In the current method for the sound field translation tasks based on spherical harmonic (SH) analysis, the solution based on the additive theorem usually faces the problem of singular values caused by large matrix condition numbers. The influence of different distances and frequencies of the spherical radial function on the stability of the translation matrix will affect the accuracy of the SH coefficients at the selected point. Due to the problems mentioned above, we propose a neural network scheme based on the dual-path transformer. More specifically, the dual-path network is constructed by the self-attention module along the two dimensions of the frequency and order axes. The transform-average-concatenate layer and upscaling layer are introduced in the network, which provides solutions for multiple sampling points and upscaling. Numerical simulation results indicate that both the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAcoustic Wave Phenomena Research · Speech and Audio Processing · Advanced Adaptive Filtering Techniques
