NEC: Speaker Selective Cancellation via Neural Enhanced Ultrasound   Shadowing

Hanqing Guo; Chenning Li; Lingkun Li; Zhichao Cao; Qiben Yan; Li Xiao

arXiv:2207.05848·cs.SD·July 14, 2022

NEC: Speaker Selective Cancellation via Neural Enhanced Ultrasound Shadowing

Hanqing Guo, Chenning Li, Lingkun Li, Zhichao Cao, Qiben Yan, Li Xiao

PDF

Open Access

TL;DR

NEC introduces a neural-enhanced ultrasound shadowing technique that selectively cancels a target speaker's voice in real-time, preventing unauthorized microphone capture without affecting other conversations.

Contribution

The paper presents a novel neural network-based method that modulates ultrasound to selectively cancel a target speaker's voice, improving privacy and security in audio recordings.

Findings

01

Effective real-time voice cancellation demonstrated on smartphones.

02

Selective cancellation without interfering with other speakers.

03

Utilizes ultrasound modulation and microphone non-linearity for accuracy.

Abstract

In this paper, we propose NEC (Neural Enhanced Cancellation), a defense mechanism, which prevents unauthorized microphones from capturing a target speaker's voice. Compared with the existing scrambling-based audio cancellation approaches, NEC can selectively remove a target speaker's voice from a mixed speech without causing interference to others. Specifically, for a target speaker, we design a Deep Neural Network (DNN) model to extract high-level speaker-specific but utterance-independent vocal features from his/her reference audios. When the microphone is recording, the DNN generates a shadow sound to cancel the target voice in real-time. Moreover, we modulate the audible shadow sound onto an ultrasound frequency, making it inaudible for humans. By leveraging the non-linearity of the microphone circuit, the microphone can accurately decode the shadow sound for target voice…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Advanced Adaptive Filtering Techniques · Speech Recognition and Synthesis