Delving Deep into Pixel Alignment Feature for Accurate Multi-view Human   Mesh Recovery

Kai Jia; Hongwen Zhang; Liang An; Yebin Liu

arXiv:2301.06020·cs.CV·December 19, 2023

Delving Deep into Pixel Alignment Feature for Accurate Multi-view Human Mesh Recovery

Kai Jia, Hongwen Zhang, Liang An, Yebin Liu

PDF

Open Access 1 Video

TL;DR

This paper introduces Pixel-aligned Feedback Fusion (PaFF), an iterative multi-view human mesh recovery method that improves accuracy by alternately extracting and fusing pixel-aligned features based on current mesh estimates.

Contribution

The paper proposes a novel iterative regression framework, PaFF, that enhances multi-view human mesh recovery by feedback-based feature extraction and fusion, disentangling global orientation from mesh parameters.

Findings

01

Achieves 33.02 MPJPE on Human3.6M dataset.

02

Significantly outperforms previous methods with over 29% improvement.

03

Validated through comprehensive ablation experiments.

Abstract

Regression-based methods have shown high efficiency and effectiveness for multi-view human mesh recovery. The key components of a typical regressor lie in the feature extraction of input views and the fusion of multi-view features. In this paper, we present Pixel-aligned Feedback Fusion (PaFF) for accurate yet efficient human mesh recovery from multi-view images. PaFF is an iterative regression framework that performs feature extraction and fusion alternately. At each iteration, PaFF extracts pixel-aligned feedback features from each input view according to the reprojection of the current estimation and fuses them together with respect to each vertex of the downsampled mesh. In this way, our regressor can not only perceive the misalignment status of each view from the feedback features but also correct the mesh parameters more effectively based on the feature fusion on mesh vertices.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Delving Deep into Pixel Alignment Feature for Accurate Multi-view Human Mesh Recovery· underline

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Image Enhancement Techniques · Advanced Vision and Imaging