Invisible Watermarking for Audio Generation Diffusion Models

Xirong Cao; Xiang Li; Divyesh Jadav; Yanzhao Wu; Zhehui Chen; Chen; Zeng; Wenqi Wei

arXiv:2309.13166·cs.SD·November 2, 2023

Invisible Watermarking for Audio Generation Diffusion Models

Xirong Cao, Xiang Li, Divyesh Jadav, Yanzhao Wu, Zhehui Chen, Chen, Zeng, Wenqi Wei

PDF

Open Access 2 Repos

TL;DR

This paper introduces an invisible watermarking technique for audio diffusion models trained on mel-spectrograms, enabling model verification and ownership protection without compromising audio generation quality.

Contribution

It is the first to apply watermarking to audio diffusion models, providing a novel method for safeguarding model integrity and copyright in audio data generation.

Findings

01

Watermark triggers effectively protect against unauthorized modifications.

02

The watermarking method maintains high audio generation quality.

03

The approach enables reliable model ownership verification.

Abstract

Diffusion models have gained prominence in the image domain for their capabilities in data generation and transformation, achieving state-of-the-art performance in various tasks in both image and audio domains. In the rapidly evolving field of audio-based machine learning, safeguarding model integrity and establishing data copyright are of paramount importance. This paper presents the first watermarking technique applied to audio diffusion models trained on mel-spectrograms. This offers a novel approach to the aforementioned challenges. Our model excels not only in benign audio generation, but also incorporates an invisible watermarking trigger mechanism for model verification. This watermark trigger serves as a protective layer, enabling the identification of model ownership and ensuring its integrity. Through extensive experiments, we demonstrate that invisible watermark triggers can…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Generative Adversarial Networks and Image Synthesis · Music Technology and Sound Studies

MethodsDiffusion