UltrasonicSpheres: Localized, Multi-Channel Sound Spheres Using Off-the-Shelf Speakers and Earables

Michael K\"uttner; Valeria Zitz; Kathrin Gerling; Michael Beigl; Tobias R\"oddiger

arXiv:2506.02715·cs.SD·July 8, 2025

UltrasonicSpheres: Localized, Multi-Channel Sound Spheres Using Off-the-Shelf Speakers and Earables

Michael K\"uttner, Valeria Zitz, Kathrin Gerling, Michael Beigl, Tobias R\"oddiger

PDF

Open Access

TL;DR

UltrasonicSpheres introduces a system that uses ultrasonic signals and off-the-shelf speakers to deliver localized, multi-channel audio to wearable earphones, enabling personalized sound experiences without extra infrastructure.

Contribution

The paper presents a novel ultrasonic audio delivery system that achieves localized, multi-channel sound using simple hardware and earphones, without tracking or infrastructure.

Findings

01

Localized multi-channel audio achieved with ultrasonic signals

02

Users can demodulate personalized streams while remaining aware of ambient sounds

03

System preserves spatial audio perception and is inaudible to others

Abstract

We present a demo of UltrasonicSpheres, a novel system for location-specific audio delivery using wearable earphones that decode ultrasonic signals into audible sound. Unlike conventional beamforming setups, UltrasonicSpheres relies on single ultrasonic speakers to broadcast localized audio with multiple channels, each encoded on a distinct ultrasonic carrier frequency. Users wearing our acoustically transparent earphones can demodulate their selected stream, such as exhibit narrations in a chosen language, while remaining fully aware of ambient environmental sounds. The experience preserves spatial audio perception, giving the impression that the sound originates directly from the physical location of the source. This enables personalized, localized audio without requiring pairing, tracking, or additional infrastructure. Importantly, visitors not equipped with the earphones are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing