RIR-Former: Coordinate-Guided Transformer for Continuous Reconstruction of Room Impulse Responses

Shaoheng Xu; Chunyi Sun; Jihui Zhang; Prasanga N. Samarasinghe; Thushara D. Abhayapala

arXiv:2602.01861·eess.AS·May 12, 2026

RIR-Former: Coordinate-Guided Transformer for Continuous Reconstruction of Room Impulse Responses

Shaoheng Xu, Chunyi Sun, Jihui Zhang, Prasanga N. Samarasinghe, Thushara D. Abhayapala

PDF

TL;DR

RIR-Former is a novel transformer-based model that reconstructs room impulse responses across space without grid constraints, effectively handling early and late reverberations for diverse acoustic environments.

Contribution

It introduces a sinusoidal encoding with a segmented multi-branch decoder into a transformer, enabling accurate, grid-free RIR interpolation at arbitrary locations.

Findings

01

Outperforms state-of-the-art methods in NMSE and cosine distance.

02

Works effectively with varying missing data rates and array configurations.

03

Demonstrates potential for practical deployment in diverse acoustic scenarios.

Abstract

Room impulse responses (RIRs) are essential for many acoustic signal processing tasks, yet measuring them densely across space is often impractical. In this work, we propose RIR-Former, a grid-free, one-step feed-forward model for RIR reconstruction. By introducing a sinusoidal encoding module into a transformer backbone, our method effectively incorporates microphone position information, enabling interpolation at arbitrary array locations. Furthermore, a segmented multi-branch decoder is designed to separately handle early reflections and late reverberation, improving reconstruction across the entire RIR. Experiments on diverse simulated acoustic environments demonstrate that RIR-Former consistently outperforms state-of-the-art baselines in terms of normalized mean square error (NMSE) and cosine distance (CD), under varying missing rates and array configurations. These results…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.