Multi-initialization Optimization Network for Accurate 3D Human Pose and Shape Estimation
Zhiwei Liu, Xiangyu Zhu, Lu Yang, Xiang Yan, Ming Tang, Zhen Lei,, Guibo Zhu, Xuetao Feng, Yan Wang, Jinqiao Wang

TL;DR
This paper introduces a three-stage framework called Multi-Initialization Optimization Network (MION) that improves 3D human pose and shape estimation from monocular images by reducing ambiguity through multiple initializations and refinement.
Contribution
The paper proposes a novel multi-initialization optimization framework with a mesh refinement transformer and a consistency estimation network for more accurate 3D human reconstruction.
Findings
Outperforms existing 3D mesh methods on benchmarks
Reduces ambiguity in 3D pose estimation
Effective multi-stage refinement process
Abstract
3D human pose and shape recovery from a monocular RGB image is a challenging task. Existing learning based methods highly depend on weak supervision signals, e.g. 2D and 3D joint location, due to the lack of in-the-wild paired 3D supervision. However, considering the 2D-to-3D ambiguities existed in these weak supervision labels, the network is easy to get stuck in local optima when trained with such labels. In this paper, we reduce the ambituity by optimizing multiple initializations. Specifically, we propose a three-stage framework named Multi-Initialization Optimization Network (MION). In the first stage, we strategically select different coarse 3D reconstruction candidates which are compatible with the 2D keypoints of input sample. Each coarse reconstruction can be regarded as an initialization leads to one optimization branch. In the second stage, we design a mesh refinement…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Advanced Vision and Imaging · Infrared Thermography in Medicine
