Class-agnostic Reconstruction of Dynamic Objects from Videos

Zhongzheng Ren; Xiaoming Zhao; Alexander G. Schwing

arXiv:2112.02091·cs.CV·December 6, 2021·6 cites

Class-agnostic Reconstruction of Dynamic Objects from Videos

Zhongzheng Ren, Xiaoming Zhao, Alexander G. Schwing

PDF

Open Access 1 Video

TL;DR

REDO is a unified, class-agnostic framework that reconstructs complete dynamic object shapes from videos, handling occlusion, various motions, and multiple categories using novel 4D implicit and transformation modules.

Contribution

The paper introduces REDO, a novel framework with two modules for class-agnostic dynamic object reconstruction from videos, addressing occlusion, diverse motions, and multiple categories.

Findings

01

REDO outperforms state-of-the-art methods on synthetic and real-world datasets.

02

The canonical 4D implicit function effectively captures object shape and appearance.

03

The 4D transformation module accurately models object dynamics.

Abstract

We introduce REDO, a class-agnostic framework to REconstruct the Dynamic Objects from RGBD or calibrated videos. Compared to prior work, our problem setting is more realistic yet more challenging for three reasons: 1) due to occlusion or camera settings an object of interest may never be entirely visible, but we aim to reconstruct the complete shape; 2) we aim to handle different object dynamics including rigid motion, non-rigid motion, and articulation; 3) we aim to reconstruct different categories of objects with one unified framework. To address these challenges, we develop two novel modules. First, we introduce a canonical 4D implicit function which is pixel-aligned with aggregated temporal visual cues. Second, we develop a 4D transformation module which captures object dynamics to support temporal propagation and aggregation. We study the efficacy of REDO in extensive experiments…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Class-agnostic Reconstruction of Dynamic Objects from Videos· slideslive

Taxonomy

TopicsAdvanced Vision and Imaging · Optical measurement and interference techniques · Human Pose and Action Recognition