Stacked Conditional Generative Adversarial Networks for Jointly Learning Shadow Detection and Shadow Removal
Jifeng Wang, Xiang Li, Le Hui, Jian Yang

TL;DR
This paper introduces a novel multi-task framework using stacked conditional GANs to jointly detect and remove shadows from images, improving performance through mutual benefits and global scene understanding.
Contribution
It proposes the first end-to-end multi-task shadow detection and removal framework with a novel stacked CGAN architecture that enhances both tasks mutually.
Findings
Outperforms state-of-the-art methods on multiple datasets.
Constructed the first large-scale shadow dataset with 1870 triplets.
Demonstrates significant improvements in shadow detection and removal accuracy.
Abstract
Understanding shadows from a single image spontaneously derives into two types of task in previous studies, containing shadow detection and shadow removal. In this paper, we present a multi-task perspective, which is not embraced by any existing work, to jointly learn both detection and removal in an end-to-end fashion that aims at enjoying the mutually improved benefits from each other. Our framework is based on a novel STacked Conditional Generative Adversarial Network (ST-CGAN), which is composed of two stacked CGANs, each with a generator and a discriminator. Specifically, a shadow image is fed into the first generator which produces a shadow detection mask. That shadow image, concatenated with its predicted mask, goes through the second generator in order to recover its shadow-free image consequently. In addition, the two corresponding discriminators are very likely to model higher…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Generative Adversarial Networks and Image Synthesis · Human Pose and Action Recognition
