Timeline and Boundary Guided Diffusion Network for Video Shadow   Detection

Haipeng Zhou; Honqiu Wang; Tian Ye; Zhaohu Xing; Jun Ma; Ping Li,; Qiong Wang; Lei Zhu

arXiv:2408.11785·cs.CV·August 22, 2024

Timeline and Boundary Guided Diffusion Network for Video Shadow Detection

Haipeng Zhou, Honqiu Wang, Tian Ye, Zhaohu Xing, Jun Ma, Ping Li,, Qiong Wang, Lei Zhu

PDF

1 Repo

TL;DR

This paper introduces a novel diffusion-based network for video shadow detection that effectively integrates temporal guidance and boundary information, significantly improving detection accuracy over existing methods.

Contribution

It is the first to apply a diffusion model to VSD, incorporating a dual scale aggregation and boundary-aware attention to enhance temporal and boundary feature learning.

Findings

01

Outperforms state-of-the-art methods in VSD accuracy

02

Effectively captures temporal and boundary features

03

Demonstrates the effectiveness of diffusion models in VSD

Abstract

Video Shadow Detection (VSD) aims to detect the shadow masks with frame sequence. Existing works suffer from inefficient temporal learning. Moreover, few works address the VSD problem by considering the characteristic (i.e., boundary) of shadow. Motivated by this, we propose a Timeline and Boundary Guided Diffusion (TBGDiff) network for VSD where we take account of the past-future temporal guidance and boundary information jointly. In detail, we design a Dual Scale Aggregation (DSA) module for better temporal understanding by rethinking the affinity of the long-term and short-term frames for the clipped video. Next, we introduce Shadow Boundary Aware Attention (SBAA) to utilize the edge contexts for capturing the characteristics of shadows. Moreover, we are the first to introduce the Diffusion model for VSD in which we explore a Space-Time Encoded Embedding (STEE) to inject the temporal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

haipengzhou856/tbgdiff
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSoftmax · Attention Is All You Need · Diffusion · Attentive Walk-Aggregating Graph Neural Network