S3O: A Dual-Phase Approach for Reconstructing Dynamic Shape and Skeleton   of Articulated Objects from Single Monocular Video

Hao Zhang; Fang Li; Samyak Rawlekar; Narendra Ahuja

arXiv:2405.12607·cs.CV·May 22, 2024

S3O: A Dual-Phase Approach for Reconstructing Dynamic Shape and Skeleton of Articulated Objects from Single Monocular Video

Hao Zhang, Fang Li, Samyak Rawlekar, Narendra Ahuja

PDF

Open Access 1 Repo

TL;DR

S3O introduces a two-phase, annotation-free method for reconstructing dynamic articulated objects from monocular videos, improving accuracy and efficiency by decoupling shape and motion learning.

Contribution

The paper presents S3O, a novel dual-phase approach that reduces computational complexity and enhances robustness in 3D reconstruction without requiring extra annotations.

Findings

01

Achieves approximately 60% reduction in training time.

02

Provides more accurate 3D reconstructions on benchmarks.

03

Produces plausible skeletons for articulated objects.

Abstract

Reconstructing dynamic articulated objects from a singular monocular video is challenging, requiring joint estimation of shape, motion, and camera parameters from limited views. Current methods typically demand extensive computational resources and training time, and require additional human annotations such as predefined parametric models, camera poses, and key points, limiting their generalizability. We propose Synergistic Shape and Skeleton Optimization (S3O), a novel two-phase method that forgoes these prerequisites and efficiently learns parametric models including visible shapes and underlying skeletons. Conventional strategies typically learn all parameters simultaneously, leading to interdependencies where a single incorrect prediction can result in significant errors. In contrast, S3O adopts a phased approach: it first focuses on learning coarse parametric models, then…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

haoz19/limr
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Human Pose and Action Recognition · Hand Gesture Recognition Systems