Reversible Unfolding Network for Concealed Visual Perception with Generative Refinement

Chunming He; Fengyang Xiao; Rihan Zhang; Chengyu Fang; Deng-Ping Fan; Sina Farsiu

arXiv:2508.15027·cs.CV·August 22, 2025

Reversible Unfolding Network for Concealed Visual Perception with Generative Refinement

Chunming He, Fengyang Xiao, Rihan Zhang, Chengyu Fang, Deng-Ping Fan, Sina Farsiu

PDF

Open Access

TL;DR

This paper introduces RUN++, a reversible unfolding network with generative refinement for concealed visual perception, effectively integrating reversible modeling and diffusion techniques to improve accuracy in mask and RGB domains under real-world conditions.

Contribution

The paper presents a novel reversible unfolding network that combines optimization, reversible modeling, and diffusion refinement for improved concealed visual perception.

Findings

01

Effective identification of concealed objects in mask and RGB domains.

02

Significant reduction in false positives and negatives.

03

Robust performance under real-world degradations.

Abstract

Existing methods for concealed visual perception (CVP) often leverage reversible strategies to decrease uncertainty, yet these are typically confined to the mask domain, leaving the potential of the RGB domain underexplored. To address this, we propose a reversible unfolding network with generative refinement, termed RUN++. Specifically, RUN++ first formulates the CVP task as a mathematical optimization problem and unfolds the iterative solution into a multi-stage deep network. This approach provides a principled way to apply reversible modeling across both mask and RGB domains while leveraging a diffusion model to resolve the resulting uncertainty. Each stage of the network integrates three purpose-driven modules: a Concealed Object Region Extraction (CORE) module applies reversible modeling to the mask domain to identify core object regions; a Context-Aware Region Enhancement (CARE)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace Recognition and Perception · Digital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis