DogMo: A Large-Scale Multi-View RGB-D Dataset for 4D Canine Motion Recovery
Zan Wang, Siyu Chen, Luya Mo, Xinfeng Gao, Yuxin Shen, Lebin Ding, Wei Liang

TL;DR
DogMo is a comprehensive multi-view RGB-D dataset of canine movements, enabling improved 4D motion recovery research through diverse data and a novel optimization pipeline.
Contribution
We introduce DogMo, the first large-scale multi-view RGB-D dog motion dataset, and a three-stage optimization method for accurate 4D canine motion reconstruction.
Findings
DogMo contains 1.2k motion sequences from 10 breeds.
The proposed method achieves state-of-the-art motion recovery accuracy.
Benchmark settings facilitate systematic evaluation of monocular and multi-view approaches.
Abstract
We present DogMo, a large-scale multi-view RGB-D video dataset capturing diverse canine movements for the task of motion recovery from images. DogMo comprises 1.2k motion sequences collected from 10 unique dogs, offering rich variation in both motion and breed. It addresses key limitations of existing dog motion datasets, including the lack of multi-view and real 3D data, as well as limited scale and diversity. Leveraging DogMo, we establish four motion recovery benchmark settings that support systematic evaluation across monocular and multi-view, RGB and RGB-D inputs. To facilitate accurate motion recovery, we further introduce a three-stage, instance-specific optimization pipeline that fits the SMAL model to the motion sequences. Our method progressively refines body shape and pose through coarse alignment, dense correspondence supervision, and temporal regularization. Our dataset and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
