UV R-CNN: Stable and Efficient Dense Human Pose Estimation

Wenhe Jia; Yilin Zhou; Xuhan Zhu; Mengjie Hu; Chun Liu; Qing Song

arXiv:2211.02337·cs.CV·November 7, 2022·1 cites

UV R-CNN: Stable and Efficient Dense Human Pose Estimation

Wenhe Jia, Yilin Zhou, Xuhan Zhu, Mengjie Hu, Chun Liu, Qing Song

PDF

Open Access

TL;DR

UV R-CNN introduces a novel architecture with a new loss function and balanced training strategy, significantly improving dense human pose estimation accuracy without auxiliary supervision.

Contribution

The paper proposes UV R-CNN with a Dense Points loss and balanced loss weighting, enhancing training stability and performance in dense human pose estimation.

Findings

01

Achieves 65.0% AP_{gps} on DensePose-COCO validation.

02

Outperforms previous methods in dense human pose estimation.

03

Stable training process without auxiliary supervision.

Abstract

Dense pose estimation is a dense 3D prediction task for instance-level human analysis, aiming to map human pixels from an RGB image to a 3D surface of the human body. Due to a large amount of surface point regression, the training process appears to be easy to collapse compared to other region-based human instance analyzing tasks. By analyzing the loss formulation of the existing dense pose estimation model, we introduce a novel point regression loss function, named Dense Points} loss to stable the training progress, and a new balanced loss weighting strategy to handle the multi-task losses. With the above novelties, we propose a brand new architecture, named UV R-CNN. Without auxiliary supervision and external knowledge from other tasks, UV R-CNN can handle many complicated issues in dense pose model training progress, achieving 65.0% $A P_{g p s}$ and 66.1% $A P_{g p s m}$ on the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Advanced Neural Network Applications · Hand Gesture Recognition Systems