UltrasonicSpheres: Localized, Multi-Channel Sound Spheres Using Off-the-Shelf Speakers and Earables
Michael K\"uttner, Valeria Zitz, Kathrin Gerling, Michael Beigl, Tobias R\"oddiger

TL;DR
UltrasonicSpheres introduces a system that uses ultrasonic signals and off-the-shelf speakers to deliver localized, multi-channel audio to wearable earphones, enabling personalized sound experiences without extra infrastructure.
Contribution
The paper presents a novel ultrasonic audio delivery system that achieves localized, multi-channel sound using simple hardware and earphones, without tracking or infrastructure.
Findings
Localized multi-channel audio achieved with ultrasonic signals
Users can demodulate personalized streams while remaining aware of ambient sounds
System preserves spatial audio perception and is inaudible to others
Abstract
We present a demo of UltrasonicSpheres, a novel system for location-specific audio delivery using wearable earphones that decode ultrasonic signals into audible sound. Unlike conventional beamforming setups, UltrasonicSpheres relies on single ultrasonic speakers to broadcast localized audio with multiple channels, each encoded on a distinct ultrasonic carrier frequency. Users wearing our acoustically transparent earphones can demodulate their selected stream, such as exhibit narrations in a chosen language, while remaining fully aware of ambient environmental sounds. The experience preserves spatial audio perception, giving the impression that the sound originates directly from the physical location of the source. This enables personalized, localized audio without requiring pairing, tracking, or additional infrastructure. Importantly, visitors not equipped with the earphones are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing
