SDDNet: Style-guided Dual-layer Disentanglement Network for Shadow   Detection

Runmin Cong; Yuchen Guan; Jinpeng Chen; Wei Zhang; Yao Zhao; and Sam; Kwong

arXiv:2308.08935·cs.CV·December 10, 2024

SDDNet: Style-guided Dual-layer Disentanglement Network for Shadow Detection

Runmin Cong, Yuchen Guan, Jinpeng Chen, Wei Zhang, Yao Zhao, and Sam, Kwong

PDF

Open Access 1 Repo

TL;DR

SDDNet introduces a novel style-guided dual-layer disentanglement approach for shadow detection, effectively separating shadow and background features to improve accuracy on complex backgrounds while maintaining real-time speed.

Contribution

The paper proposes a new dual-layer disentanglement network with style-guided modules to better handle background interference in shadow detection tasks.

Findings

01

Outperforms existing methods on three public datasets

02

Achieves real-time inference at 32 FPS

03

Effectively reduces background color interference

Abstract

Despite significant progress in shadow detection, current methods still struggle with the adverse impact of background color, which may lead to errors when shadows are present on complex backgrounds. Drawing inspiration from the human visual system, we treat the input shadow image as a composition of a background layer and a shadow layer, and design a Style-guided Dual-layer Disentanglement Network (SDDNet) to model these layers independently. To achieve this, we devise a Feature Separation and Recombination (FSR) module that decomposes multi-level features into shadow-related and background-related components by offering specialized supervision for each component, while preserving information integrity and avoiding redundancy through the reconstruction constraint. Moreover, we propose a Shadow Style Filter (SSF) module to guide the feature disentanglement by focusing on style…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rmcong/sddnet_acmmm23
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Face recognition and analysis · Remote-Sensing Image Classification

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings