Representation Learning for Audio Privacy Preservation using Source   Separation and Robust Adversarial Learning

Diep Luong; Minh Tran; Shayan Gharib; Konstantinos Drossos; Tuomas; Virtanen

arXiv:2308.04960·cs.SD·May 5, 2025

Representation Learning for Audio Privacy Preservation using Source Separation and Robust Adversarial Learning

Diep Luong, Minh Tran, Shayan Gharib, Konstantinos Drossos, Tuomas, Virtanen

PDF

Open Access

TL;DR

This paper introduces a novel audio privacy preservation method combining source separation and adversarial learning to protect speech data in acoustic monitoring systems while maintaining task performance.

Contribution

The study proposes an integrated approach that enhances speech privacy preservation by combining source separation with adversarial representation learning, a novel combination in this context.

Findings

01

Significantly improves speech privacy preservation over baseline methods.

02

Maintains high performance in acoustic monitoring tasks.

03

Effective in preventing differentiation between speech and non-speech recordings.

Abstract

Privacy preservation has long been a concern in smart acoustic monitoring systems, where speech can be passively recorded along with a target signal in the system's operating environment. In this study, we propose the integration of two commonly used approaches in privacy preservation: source separation and adversarial representation learning. The proposed system learns the latent representation of audio recordings such that it prevents differentiating between speech and non-speech recordings. Initially, the source separation network filters out some of the privacy-sensitive data, and during the adversarial learning process, the system will learn privacy-preserving representation on the filtered signal. We demonstrate the effectiveness of our proposed method by comparing our method against systems without source separation, without adversarial learning, and without both. Overall, our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Music and Audio Processing · Geophysical Methods and Applications