Training Large ASR Encoders with Differential Privacy

Geeticka Chauhan; Steve Chien; Om Thakkar; Abhradeep Thakurta; Arun; Narayanan

arXiv:2409.13953·cs.SD·September 24, 2024

Training Large ASR Encoders with Differential Privacy

Geeticka Chauhan, Steve Chien, Om Thakkar, Abhradeep Thakurta, Arun, Narayanan

PDF

Open Access

TL;DR

This paper introduces the first application of differential privacy to self-supervised learning for large speech models, demonstrating effective privacy-utility trade-offs in ASR tasks.

Contribution

It pioneers the use of DP in SSL for ASR, proposes gradient-based layer freezing for better privacy-utility balance, and evaluates on LibriSpeech with strong results.

Findings

01

Achieved low WER with DP on LibriSpeech

02

Introduced gradient-based layer freezing for privacy

03

Demonstrated robustness of DP SSL in ASR

Abstract

Self-supervised learning (SSL) methods for large speech models have proven to be highly effective at ASR. With the interest in public deployment of large pre-trained models, there is a rising concern for unintended memorization and leakage of sensitive data points from the training data. In this paper, we apply differentially private (DP) pre-training to a SOTA Conformer-based encoder, and study its performance on a downstream ASR task assuming the fine-tuning data is public. This paper is the first to apply DP to SSL for ASR, investigating the DP noise tolerance of the BEST-RQ pre-training method. Notably, we introduce a novel variant of model pruning called gradient-based layer freezing that provides strong improvements in privacy-utility-compute trade-offs. Our approach yields a LibriSpeech test-clean/other WER (%) of 3.78/ 8.41 with ( $10$ , 1e^-9)-DP for extrapolation towards low…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModular Robots and Swarm Intelligence · Wireless Body Area Networks · Context-Aware Activity Recognition Systems

MethodsPruning