Safeguarding Privacy in Edge Speech Understanding with Tiny Foundation Models
Afsara Benazir, Felix Xiaozhu Lin

TL;DR
SpeechShield is a novel privacy-preserving speech inference engine that uses tiny foundation models on resource-constrained devices to filter sensitive content without sacrificing transcription accuracy, achieving state-of-the-art performance.
Contribution
The paper introduces SpeechShield, the first system leveraging tiny speech foundation models for on-device privacy filtering in speech recognition, with significant efficiency improvements.
Findings
Filters 83% of private entities on-device
Achieves state-of-the-art transcription with <100MB memory
Reduces WER by up to 77.5% compared to existing methods
Abstract
Robust speech recognition systems rely on cloud service providers for inference. It needs to ensure that an untrustworthy provider cannot deduce the sensitive content in speech. Sanitization can be done on speech content keeping in mind that it has to avoid compromising transcription accuracy. Realizing the under utilized capabilities of tiny speech foundation models (FMs), for the first time, we propose a novel use: enhancing speech privacy on resource-constrained devices. We introduce SpeechShield, an edge/cloud privacy preserving speech inference engine that can filter sensitive entities without compromising transcript accuracy. We utilize a timestamp based on-device masking approach that utilizes a token to entity prediction model to filter sensitive entities. Our choice of mask strategically conceals parts of the input and hides sensitive data. The masked input is sent to a trusted…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsChaos-based Image/Signal Encryption · Privacy-Preserving Technologies in Data
Methodstravel james
