Long-term Conversation Analysis: Exploring Utility and Privacy

Francesco Nespoli; Jule Pohlhausen; Patrick A. Naylor; Joerg Bitzer

arXiv:2306.16071·eess.AS·June 29, 2023

Long-term Conversation Analysis: Exploring Utility and Privacy

Francesco Nespoli, Jule Pohlhausen, Patrick A. Naylor, Joerg Bitzer

PDF

Open Access 1 Repo

TL;DR

This paper presents a privacy-preserving feature extraction method for long-term conversation analysis that balances utility and privacy using spectral smoothing and McAdams coefficient-based speaker anonymization.

Contribution

It introduces a novel combination of spectral smoothing and McAdams coefficient for privacy-preserving feature extraction in conversational data.

Findings

01

The method maintains utility in voice activity detection and speaker diarization.

02

It improves privacy protection in speech recognition and speaker verification.

03

The combination of techniques balances utility and privacy effectively.

Abstract

The analysis of conversations recorded in everyday life requires privacy protection. In this contribution, we explore a privacy-preserving feature extraction method based on input feature dimension reduction, spectral smoothing and the low-cost speaker anonymization technique based on McAdams coefficient. We assess the utility of the feature extraction methods with a voice activity detection and a speaker diarization system, while privacy protection is determined with a speech recognition and a speaker verification model. We show that the combination of McAdams coefficient and spectral smoothing maintains the utility while improving privacy.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ol-mega/ppca
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and Audio Processing