Video Shadow Detection via Spatio-Temporal Interpolation Consistency Training
Xiao Lu, Yihong Cao, Sheng Liu, Chengjiang Long, Zipei Chen, Xuanyu, Zhou, Yimin Yang, Chunxia Xiao

TL;DR
This paper introduces a novel Spatio-Temporal Interpolation Consistency Training framework for video shadow detection that leverages unlabeled video frames and labeled images to improve accuracy and temporal consistency.
Contribution
It proposes a new training framework with spatial and temporal interpolation consistency constraints, along with a scale-aware network for multi-scale shadow learning, enhancing video shadow detection without requiring video labels.
Findings
Outperforms most state-of-the-art supervised, semi-supervised, and unsupervised methods.
Effective in improving temporal consistency in shadow detection.
Validated on ViSha and a self-annotated dataset.
Abstract
It is challenging to annotate large-scale datasets for supervised video shadow detection methods. Using a model trained on labeled images to the video frames directly may lead to high generalization error and temporal inconsistent results. In this paper, we address these challenges by proposing a Spatio-Temporal Interpolation Consistency Training (STICT) framework to rationally feed the unlabeled video frames together with the labeled images into an image shadow detection network training. Specifically, we propose the Spatial and Temporal ICT, in which we define two new interpolation schemes, \textit{i.e.}, the spatial interpolation and the temporal interpolation. We then derive the spatial and temporal interpolation consistency constraints accordingly for enhancing generalization in the pixel-wise classification task and for encouraging temporal consistent predictions, respectively. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Human Pose and Action Recognition · Gait Recognition and Analysis
