FlashClear: Ultra-Fast Image Content Removal via Efficient Step Distillation and Feature Caching

Yixin Tang; Jiawei Guo; Junxian Li; Zhiteng Li; Jixin Zhao; Bingya Zhang; Chenbo Wang; Yulun Zhang; Shangchen Zhou

arXiv:2605.09003·cs.CV·May 13, 2026

FlashClear: Ultra-Fast Image Content Removal via Efficient Step Distillation and Feature Caching

Yixin Tang, Jiawei Guo, Junxian Li, Zhiteng Li, Jixin Zhao, Bingya Zhang, Chenbo Wang, Yulun Zhang, Shangchen Zhou

PDF

TL;DR

FlashClear is a highly efficient diffusion-based image content removal method that significantly accelerates inference by focusing on foreground regions, achieving up to 122x speedup without sacrificing quality.

Contribution

The paper introduces RAD and FPAC strategies for rapid, region-aware diffusion-based object removal, reducing computational cost and inference time.

Findings

01

FlashClear achieves up to 8.26× speedup over ObjectClear.

02

FlashClear maintains high visual quality and fidelity.

03

The framework outperforms existing methods on the OBER benchmark.

Abstract

Recently, diffusion-based object removal models have achieved impressive results in eliminating objects and their associated visual effects. However, they indiscriminately denoise all tokens across all timesteps, ignoring that removal usually involves small foreground regions. This strategy introduces substantial computational overhead and prolonged inference times. To overcome this computational burden, we propose a latent discriminator to implement Region-aware Adversarial Distillation (RAD), yielding a highly efficient few-step model named FlashClear. Furthermore, tailored to few-step diffusion models, we propose FPAC (Foreground-Prioritized Asymmetric Attention and Caching), a training-free acceleration strategy. Extensive experiments demonstrate that our framework provides massive acceleration while maintaining or exceeding the performance of our base model, ObjectClear. Notably,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.