Toward Efficient Generalization in 3D Human Pose Estimation via a   Canonical Domain Approach

Hoosang Lee; Jeha Ryu

arXiv:2501.16146·cs.CV·January 28, 2025

Toward Efficient Generalization in 3D Human Pose Estimation via a Canonical Domain Approach

Hoosang Lee, Jeha Ryu

PDF

Open Access

TL;DR

This paper introduces a canonical domain approach for 3D human pose estimation that reduces domain gap issues, enabling models to generalize better across datasets without extra fine-tuning.

Contribution

The paper proposes a novel canonicalization process that maps source and target domains into a unified domain, improving cross-dataset generalization in 3D human pose estimation.

Findings

01

Significant improvement in cross-dataset generalization performance

02

Method reduces need for fine-tuning on new datasets

03

Effective across various lifting networks and datasets

Abstract

Recent advancements in deep learning methods have significantly improved the performance of 3D Human Pose Estimation (HPE). However, performance degradation caused by domain gaps between source and target domains remains a major challenge to generalization, necessitating extensive data augmentation and/or fine-tuning for each specific target domain. To address this issue more efficiently, we propose a novel canonical domain approach that maps both the source and target domains into a unified canonical domain, alleviating the need for additional fine-tuning in the target domain. To construct the canonical domain, we introduce a canonicalization process to generate a novel canonical 2D-3D pose mapping that ensures 2D-3D pose consistency and simplifies 2D-3D pose patterns, enabling more efficient training of lifting networks. The canonicalization of both domains is achieved through the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Advanced Vision and Imaging · Gait Recognition and Analysis