Deshadow-Anything: When Segment Anything Model Meets Zero-shot shadow removal

Xiao Feng Zhang; Tian Yi Song; Jia Wei Yao

arXiv:2309.11715·cs.CV·August 27, 2025·1 cites

Deshadow-Anything: When Segment Anything Model Meets Zero-shot shadow removal

Xiao Feng Zhang, Tian Yi Song, Jia Wei Yao

PDF

Open Access

TL;DR

This paper introduces Deshadow-Anything, a method that combines the Segment Anything Model with diffusion techniques and novel guidance strategies to perform zero-shot shadow removal while preserving image details.

Contribution

It presents a new approach integrating SAM with diffusion models and innovative guidance methods for effective zero-shot shadow removal.

Findings

01

Effective shadow removal with preserved image details

02

Improved training speed via MSAG and DDPM-AIP

03

Superior performance on shadow removal benchmarks

Abstract

Segment Anything (SAM), an advanced universal image segmentation model trained on an expansive visual dataset, has set a new benchmark in image segmentation and computer vision. However, it faced challenges when it came to distinguishing between shadows and their backgrounds. To address this, we developed Deshadow-Anything, considering the generalization of large-scale datasets, and we performed Fine-tuning on large-scale datasets to achieve image shadow removal. The diffusion model can diffuse along the edges and textures of an image, helping to remove shadows while preserving the details of the image. Furthermore, we design Multi-Self-Attention Guidance (MSAG) and adaptive input perturbation (DDPM-AIP) to accelerate the iterative training speed of diffusion. Experiments on shadow removal tasks demonstrate that these methods can effectively improve image restoration performance.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Generative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques

MethodsDiffusion · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings