Asymmetric Phase Coding Audio Watermarking
Guang Yang, Amir Ghasemian, Ninareh Mehrabi, Homa Hosseinmardi

TL;DR
This paper introduces Asymmetric Phase Coding (APC), a cryptographic audio watermarking method that is robust against various distortions and attacks, enabling reliable verification of audio provenance.
Contribution
APC is a novel, training-free cryptographic signing layer for audio that combines digital signatures, error correction, and phase-based encoding for robust, non-repudiable watermarks.
Findings
Achieves 97.5% to 98.3% verification accuracy under various distortions.
Maintains mean PESQ of 3.02, indicating good audio quality.
Offers fast verification with tens of milliseconds CPU latency.
Abstract
The proliferation of deepfake audio challenges voice-based authentication systems; passive forensic detectors are sensitive to evolving generative models and to real-world channel distortions. We propose Asymmetric Phase Coding (APC), a training-free cryptographic signing layer for audio, designed as a compact and auditable provenance primitive that can stand alone or be stacked with learned watermarks. APC combines Ed25519 digital signatures (EdDSA, FIPS 186-5; 64-byte signatures) with Reed-Solomon error correction, pseudo-random STFT phase-bin selection, and a redundant quantization-index-modulation (QIM) code on log-magnitude differences of adjacent bin pairs, yielding a compact, non-repudiable, blind-extractable watermark. We evaluate APC on 1,000 LibriSpeech test-clean clips (10 s each, 44.1 kHz) under eight attack configurations -- identity, 10% end-cropping, 20% end-cropping, 8…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
