Deep Room Recognition Using Inaudible Echos
Qun Song, Chaojie Gu, Rui Tan

TL;DR
This paper introduces a privacy-preserving, deep learning-based method for room recognition using inaudible echoes captured in 0.1 seconds, achieving high accuracy across various indoor environments without additional hardware.
Contribution
It presents a novel deep learning approach utilizing short, narrowband inaudible audio signals for accurate, privacy-preserving room recognition, and develops a cloud service for easy integration.
Findings
Achieves up to 99.7% accuracy in room differentiation
Outperforms state-of-the-art SVM-based methods in accuracy and robustness
Requires only 0.1 seconds of inaudible audio for recognition
Abstract
Recent years have seen the increasing need of location awareness by mobile applications. This paper presents a room-level indoor localization approach based on the measured room's echos in response to a two-millisecond single-tone inaudible chirp emitted by a smartphone's loudspeaker. Different from other acoustics-based room recognition systems that record full-spectrum audio for up to ten seconds, our approach records audio in a narrow inaudible band for 0.1 seconds only to preserve the user's privacy. However, the short-time and narrowband audio signal carries limited information about the room's characteristics, presenting challenges to accurate room recognition. This paper applies deep learning to effectively capture the subtle fingerprints in the rooms' acoustic responses. Our extensive experiments show that a two-layer convolutional neural network fed with the spectrogram of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIndoor and Outdoor Localization Technologies · Speech and Audio Processing · Music and Audio Processing
