Delving Deep into Pixel Alignment Feature for Accurate Multi-view Human Mesh Recovery
Kai Jia, Hongwen Zhang, Liang An, Yebin Liu

TL;DR
This paper introduces Pixel-aligned Feedback Fusion (PaFF), an iterative multi-view human mesh recovery method that improves accuracy by alternately extracting and fusing pixel-aligned features based on current mesh estimates.
Contribution
The paper proposes a novel iterative regression framework, PaFF, that enhances multi-view human mesh recovery by feedback-based feature extraction and fusion, disentangling global orientation from mesh parameters.
Findings
Achieves 33.02 MPJPE on Human3.6M dataset.
Significantly outperforms previous methods with over 29% improvement.
Validated through comprehensive ablation experiments.
Abstract
Regression-based methods have shown high efficiency and effectiveness for multi-view human mesh recovery. The key components of a typical regressor lie in the feature extraction of input views and the fusion of multi-view features. In this paper, we present Pixel-aligned Feedback Fusion (PaFF) for accurate yet efficient human mesh recovery from multi-view images. PaFF is an iterative regression framework that performs feature extraction and fusion alternately. At each iteration, PaFF extracts pixel-aligned feedback features from each input view according to the reprojection of the current estimation and fuses them together with respect to each vertex of the downsampled mesh. In this way, our regressor can not only perceive the misalignment status of each view from the feedback features but also correct the mesh parameters more effectively based on the feature fusion on mesh vertices.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Image Enhancement Techniques · Advanced Vision and Imaging
