Occluded Human Mesh Recovery
Rawal Khirodkar, Shashank Tripathi, Kris Kitani

TL;DR
This paper introduces OCHMR, a top-down approach for 3D human mesh recovery that effectively handles multi-person occlusion by incorporating spatial context through body-center heatmaps and contextual normalization.
Contribution
The paper proposes a novel contextual reasoning architecture with CoNorm blocks that improves multi-person mesh recovery under occlusion, applicable to existing top-down models.
Findings
Achieves state-of-the-art results on 3DPW, CrowdPose, and OCHuman datasets.
Significantly improves accuracy over baseline models in occluded scenarios.
Demonstrates robustness to severe multi-person occlusion.
Abstract
Top-down methods for monocular human mesh recovery have two stages: (1) detect human bounding boxes; (2) treat each bounding box as an independent single-human mesh recovery task. Unfortunately, the single-human assumption does not hold in images with multi-human occlusion and crowding. Consequently, top-down methods have difficulties in recovering accurate 3D human meshes under severe person-person occlusion. To address this, we present Occluded Human Mesh Recovery (OCHMR) - a novel top-down mesh recovery approach that incorporates image spatial context to overcome the limitations of the single-human assumption. The approach is conceptually simple and can be applied to any existing top-down architecture. Along with the input image, we condition the top-down model on spatial context from the image in the form of body-center heatmaps. To reason from the predicted body centermaps, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Human Pose and Action Recognition · Image Enhancement Techniques
