Learning Compositional Representation for 4D Captures with Neural ODE

Boyan Jiang; Yinda Zhang; Xingkui Wei; Xiangyang Xue; Yanwei Fu

arXiv:2103.08271·cs.CV·April 21, 2021·1 cites

Learning Compositional Representation for 4D Captures with Neural ODE

Boyan Jiang, Yinda Zhang, Xingkui Wei, Xiangyang Xue, Yanwei Fu

PDF

Open Access

TL;DR

This paper introduces a novel compositional 4D representation using neural ODEs to model deforming 3D objects over time, enabling better reconstruction, motion transfer, and completion.

Contribution

It proposes a disentangled 4D representation with a neural ODE for motion, and an Identity Exchange Training strategy for effective component decoupling.

Findings

01

Outperforms state-of-the-art methods on 4D reconstruction

02

Improves motion transfer and completion tasks

03

Effectively disentangles shape, initial state, and motion

Abstract

Learning based representation has become the key to the success of many computer vision systems. While many 3D representations have been proposed, it is still an unaddressed problem how to represent a dynamically changing 3D object. In this paper, we introduce a compositional representation for 4D captures, i.e. a deforming 3D object over a temporal span, that disentangles shape, initial state, and motion respectively. Each component is represented by a latent code via a trained encoder. To model the motion, a neural Ordinary Differential Equation (ODE) is trained to update the initial state conditioned on the learned motion code, and a decoder takes the shape code and the updated state code to reconstruct the 3D model at each time stamp. To this end, we propose an Identity Exchange Training (IET) strategy to encourage the network to learn effectively decoupling each component.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Advanced Vision and Imaging · Human Pose and Action Recognition