Multiplexing Neural Audio Watermarks
Zheqi Yuan, Yucheng Huang, Guangzhi Sun, Zengrui Jin, Chao Zhang

TL;DR
This paper introduces a multiplexing approach for neural audio watermarking that combines multiple techniques to improve robustness against distortions and attacks, outperforming existing methods on standard datasets.
Contribution
It proposes a novel multiplexing paradigm with PA-TFM and MaskNet, enhancing robustness and performance in neural audio watermarking.
Findings
PA-TFM and MaskNet outperform single-watermark methods.
The approach is robust against 14 diverse attack types.
Experimental results on LibriSpeech and Common Voice datasets.
Abstract
Audio watermarking is essential for verifying speech authenticity, yet single-watermark schemes often struggle against sophisticated distortions such as neural reconstruction and adversarial attacks. To address this limitation, we introduce a multiplexing paradigm that combines multiple watermarking techniques to leverage their inherent complementarities. We explore both parallel and sequential multiplexing strategies and propose perceptual-adaptive time-frequency multiplexing (PA-TFM), a robust training-free approach. To further enhance performance, we introduce MaskNet, a novel model-based framework designed to learn effective time-domain multiplexing. Experimental results on the LibriSpeech and Common Voice datasets under 14 diverse attack types, including high-strength white-box and neural reconstruction attacks, demonstrate that both PA-TFM and MaskNet considerably outperform…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Steganography and Watermarking Techniques · Digital Media Forensic Detection · Adversarial Robustness in Machine Learning
