DeepiSign: Invisible Fragile Watermark to Protect the Integrityand Authenticity of CNN
Alsharif Abuadbba, Hyoungshick Kim, Surya Nepal

TL;DR
DeepiSign introduces a fragile invisible watermarking technique to protect CNN models' integrity and authenticity against manipulation attacks, embedding secrets into less significant frequency domain coefficients with minimal accuracy impact.
Contribution
It presents a novel wavelet-based fragile watermarking method for CNNs that ensures model integrity without degrading performance, and demonstrates robustness against various attacks.
Findings
Successfully embeds up to 1KB secret per layer with minimal accuracy loss.
Verifies model integrity effectively against poisoning and fine-tuning attacks.
Maintains classification accuracy while providing tamper-proofing.
Abstract
Convolutional Neural Networks (CNNs) deployed in real-life applications such as autonomous vehicles have shown to be vulnerable to manipulation attacks, such as poisoning attacks and fine-tuning. Hence, it is essential to ensure the integrity and authenticity of CNNs because compromised models can produce incorrect outputs and behave maliciously. In this paper, we propose a self-contained tamper-proofing method, called DeepiSign, to ensure the integrity and authenticity of CNN models against such manipulation attacks. DeepiSign applies the idea of fragile invisible watermarking to securely embed a secret and its hash value into a CNN model. To verify the integrity and authenticity of the model, we retrieve the secret from the model, compute the hash value of the secret, and compare it with the embedded hash value. To minimize the effects of the embedded secret on the CNN model, we use a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
