Event Classification by Physics-informed Inpainting for Distributed Multichannel Acoustic Sensor with Partially Degraded Channels

Noriyuki Tonami; Wataru Kohno; Yoshiyuki Yajima; Sakiko Mishima; Yumi Arai; Reishi Kondo; Tomoyuki Hino

arXiv:2601.13513·cs.SD·January 21, 2026

Event Classification by Physics-informed Inpainting for Distributed Multichannel Acoustic Sensor with Partially Degraded Channels

Noriyuki Tonami, Wataru Kohno, Yoshiyuki Yajima, Sakiko Mishima, Yumi Arai, Reishi Kondo, Tomoyuki Hino

PDF

Open Access

TL;DR

This paper introduces a physics-informed inpainting method using reverse time migration for distributed multichannel acoustic sensors, significantly improving sound event classification accuracy in degraded and layout-mismatched sensor setups.

Contribution

It presents a novel, learning-free, physics-based preprocessing technique that enhances multichannel acoustic sensing performance under challenging conditions.

Findings

01

Achieves up to 13.1 percentage point accuracy improvement on challenging sensor layouts.

02

Outperforms baseline methods in accuracy across various sensor configurations.

03

Correlation analysis shows spatial weights align more with SNR than with channel-source distance.

Abstract

Distributed multichannel acoustic sensing (DMAS) enables large-scale sound event classification (SEC), but performance drops when many channels are degraded and when sensor layouts at test time differ from training layouts. We propose a learning-free, physics-informed inpainting frontend based on reverse time migration (RTM). In this approach, observed multichannel spectrograms are first back-propagated on a 3D grid using an analytic Green's function to form a scene-consistent image, and then forward-projected to reconstruct inpainted signals before log-mel feature extraction and Transformer-based classification. We evaluate the method on ESC-50 with 50 sensors and three layouts (circular, linear, right-angle), where per-channel SNRs are sampled from -30 to 0 dB. Compared with an AST baseline, scaling-sparsemax channel selection, and channel-swap augmentation, the proposed RTM frontend…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Neural Networks and Reservoir Computing · Phonocardiography and Auscultation Techniques