BatVision: Learning to See 3D Spatial Layout with Two Ears
Jesper Haahr Christensen, Sascha Hornauer, Stella Yu

TL;DR
This paper introduces BatVision, a low-cost system inspired by bat echolocation that uses sound to perceive 3D spatial layouts, enabling vision-like capabilities in low-light conditions.
Contribution
We develop a novel sound-based perception system that predicts visual scene depth and grayscale images from audio signals, inspired by biological echolocation.
Findings
Accurately predicts depth maps from sound alone
Provides good visual scene predictions from audio signals
Operates effectively in low-light or no-light conditions
Abstract
Many species have evolved advanced non-visual perception while artificial systems fall behind. Radar and ultrasound complement camera-based vision but they are often too costly and complex to set up for very limited information gain. In nature, sound is used effectively by bats, dolphins, whales, and humans for navigation and communication. However, it is unclear how to best harness sound for machine perception. Inspired by bats' echolocation mechanism, we design a low-cost BatVision system that is capable of seeing the 3D spatial layout of space ahead by just listening with two ears. Our system emits short chirps from a speaker and records returning echoes through microphones in an artificial human pinnae pair. During training, we additionally use a stereo camera to capture color images for calculating scene depths. We train a model to predict depth maps and even grayscale images from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBat Biology and Ecology Studies · Image Processing Techniques and Applications · Video Surveillance and Tracking Methods
MethodsConcatenated Skip Connection · Max Pooling · Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · U-Net
