D$^2$NeRF: Self-Supervised Decoupling of Dynamic and Static Objects from a Monocular Video
Tianhao Wu, Fangcheng Zhong, Andrea Tagliasacchi, Forrester Cole,, Cengiz Oztireli

TL;DR
D$^2$NeRF is a self-supervised method that learns to separate dynamic objects and shadows from static backgrounds in monocular videos by representing them with decoupled neural radiance fields, improving 3D scene understanding.
Contribution
The paper introduces a novel self-supervised neural radiance field approach that decouples dynamic objects and shadows from static scenes using a new loss and shadow field network.
Findings
Outperforms state-of-the-art in decoupling dynamic/static objects
Effective shadow detection and removal
Improved image segmentation of moving objects
Abstract
Given a monocular video, segmenting and decoupling dynamic objects while recovering the static environment is a widely studied problem in machine intelligence. Existing solutions usually approach this problem in the image domain, limiting their performance and understanding of the environment. We introduce Decoupled Dynamic Neural Radiance Field (DNeRF), a self-supervised approach that takes a monocular video and learns a 3D scene representation which decouples moving objects, including their shadows, from the static background. Our method represents the moving objects and the static background by two separate neural radiance fields with only one allowing for temporal changes. A naive implementation of this approach leads to the dynamic component taking over the static one as the representation of the former is inherently more general and prone to overfitting. To this end, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsHuman Pose and Action Recognition · Advanced Neural Network Applications · Anomaly Detection Techniques and Applications
