ReVision : A Post-Hoc, Vision-Based Technique for Replacing Unacceptable Concepts in Image Generation Pipeline

Gurjot Singh; Prabhjot Singh; Aashima Sharma; Maninder Singh; Ryan Ko

arXiv:2602.19149·cs.CR·March 3, 2026

ReVision : A Post-Hoc, Vision-Based Technique for Replacing Unacceptable Concepts in Image Generation Pipeline

Gurjot Singh, Prabhjot Singh, Aashima Sharma, Maninder Singh, Ryan Ko

PDF

Open Access

TL;DR

ReVision is a training-free, post-hoc safety framework for image generation that detects and edits unsafe concepts in images, improving safety and scene fidelity without retraining the generator.

Contribution

ReVision introduces a novel, prompt-based, post-hoc safety method with a VLM-assisted spatial gating mechanism for precise unsafe concept editing in images.

Findings

01

Improves CLIP alignment toward safe prompts by +0.121 on average.

02

Significantly enhances multi-concept background fidelity (LPIPS 0.166 to 0.058).

03

Reduces recognizability of policy-violating content from 95.99% to 10.16%.

Abstract

Image-generative models are widely deployed across industries. Recent studies show that they can be exploited to produce policy-violating content. Existing mitigation strategies primarily operate at the pre- or mid-generation stages through techniques such as prompt filtering and safety-aware training/fine-tuning. Prior work shows that these approaches can be bypassed and often degrade generative quality. In this work, we propose ReVision, a training-free, prompt-based, post-hoc safety framework for image-generation pipeline. ReVision acts as a last-line defense by analyzing generated images and selectively editing unsafe concepts without altering the underlying generator. It uses the Gemini-2.5-Flash model as a generic policy-violating concept detector, avoiding reliance on multiple category-specific detectors, and performs localized semantic editing to replace unsafe content. Prior…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Ethics and Social Impacts of AI · Multimodal Machine Learning Applications