Improving Security in McAdams Coefficient-Based Speaker Anonymization by Watermarking Method
Candy Olivia Mawalim, Masashi Unoki

TL;DR
This paper introduces a novel speech watermarking approach to enhance security in McAdams coefficient-based speaker anonymization, effectively improving privacy protection while maintaining speech quality and robustness.
Contribution
It proposes a new watermarking method that embeds binary information into the McAdams coefficient, enhancing security and anonymization performance in speaker privacy systems.
Findings
Successfully satisfies blind detection, inaudibility, and robustness in watermarking.
Significantly improves anonymization performance over existing systems.
Demonstrates effectiveness through objective evaluations aligned with VP2020 and IHC standards.
Abstract
Speaker anonymization aims to suppress speaker individuality to protect privacy in speech while preserving the other aspects, such as speech content. One effective solution for anonymization is to modify the McAdams coefficient. In this work, we propose a method to improve the security for speaker anonymization based on the McAdams coefficient by using a speech watermarking approach. The proposed method consists of two main processes: one for embedding and one for detection. In embedding process, two different McAdams coefficients represent binary bits ``0" and ``1". The watermarked speech is then obtained by frame-by-frame bit inverse switching. Subsequently, the detection process is carried out by a power spectrum comparison. We conducted objective evaluations with reference to the VoicePrivacy 2020 Challenge (VP2020) and of the speech watermarking with reference to the Information…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Music and Audio Processing
