DeepForgeSeal: Latent Space-Driven Semi-Fragile Watermarking for Deepfake Detection Using Multi-Agent Adversarial Reinforcement Learning

Tharindu Fernando; Clinton Fookes; Sridha Sridharan

arXiv:2511.04949·cs.CV·November 10, 2025

DeepForgeSeal: Latent Space-Driven Semi-Fragile Watermarking for Deepfake Detection Using Multi-Agent Adversarial Reinforcement Learning

Tharindu Fernando, Clinton Fookes, Sridha Sridharan

PDF

Open Access

TL;DR

DeepForgeSeal introduces a novel latent space-based semi-fragile watermarking framework utilizing multi-agent adversarial reinforcement learning to improve deepfake detection robustness and adaptability.

Contribution

It proposes a learnable watermark embedder in latent space combined with MAARL to balance robustness and fragility against manipulations.

Findings

01

Outperforms state-of-the-art methods on CelebA and CelebA-HQ benchmarks.

02

Achieves over 4.5% and 5.3% improvements under challenging scenarios.

03

Demonstrates effective high-level semantic encoding in watermarking.

Abstract

Rapid advances in generative AI have led to increasingly realistic deepfakes, posing growing challenges for law enforcement and public trust. Existing passive deepfake detectors struggle to keep pace, largely due to their dependence on specific forgery artifacts, which limits their ability to generalize to new deepfake types. Proactive deepfake detection using watermarks has emerged to address the challenge of identifying high-quality synthetic media. However, these methods often struggle to balance robustness against benign distortions with sensitivity to malicious tampering. This paper introduces a novel deep learning framework that harnesses high-dimensional latent space representations and the Multi-Agent Adversarial Reinforcement Learning (MAARL) paradigm to develop a robust and adaptive watermarking approach. Specifically, we develop a learnable watermark embedder that operates in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Adversarial Robustness in Machine Learning