EDGE-Shield: Efficient Denoising-staGE Shield for Violative Content Filtering via Scalable Reference-Based Matching

Takara Taniguchi; Ryohei Shimizu; Duc Minh Vo; Kota Izumi; Shiqi Yang; Teppei Suzuki

arXiv:2604.06063·cs.CV·April 15, 2026

EDGE-Shield: Efficient Denoising-staGE Shield for Violative Content Filtering via Scalable Reference-Based Matching

Takara Taniguchi, Ryohei Shimizu, Duc Minh Vo, Kota Izumi, Shiqi Yang, Teppei Suzuki

PDF

TL;DR

EDGE-Shield is a scalable, training-free content filtering method for Text-to-Image models that efficiently blocks violative content during denoising, significantly reducing processing time while maintaining accuracy.

Contribution

It introduces an embedding-based matching approach and an $x$-pred transformation to improve scalability and early-stage classification in content filtering.

Findings

01

Achieves approximately 79% reduction in processing time for Z-Image-Turbo.

02

Achieves approximately 50% reduction in processing time for Qwen-Image.

03

Maintains filtering accuracy across different generative model architectures.

Abstract

The advent of Text-to-Image generative models poses significant risks of copyright violation and deepfake generation. Since the rapid proliferation of new copyrighted works and private individuals constantly emerges, reference-based training-free content filters are essential for providing up-to-date protection without the constraints of a fixed knowledge cutoff. However, existing reference-based approaches often lack scalability when handling numerous references and require waiting for finishing image generation. To solve these problems, we propose EDGE-Shield, a scalable content filter during the denoising process that maintains practical latency while effectively blocking violative content. We leverage embedding-based matching for efficient reference comparison. Additionally, we introduce an \textit{ $x$ }-pred transformation that converts the model's noisy intermediate latent into the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.