Occluded Human Mesh Recovery

Rawal Khirodkar; Shashank Tripathi; Kris Kitani

arXiv:2203.13349·cs.CV·March 28, 2022

Occluded Human Mesh Recovery

Rawal Khirodkar, Shashank Tripathi, Kris Kitani

PDF

Open Access

TL;DR

This paper introduces OCHMR, a top-down approach for 3D human mesh recovery that effectively handles multi-person occlusion by incorporating spatial context through body-center heatmaps and contextual normalization.

Contribution

The paper proposes a novel contextual reasoning architecture with CoNorm blocks that improves multi-person mesh recovery under occlusion, applicable to existing top-down models.

Findings

01

Achieves state-of-the-art results on 3DPW, CrowdPose, and OCHuman datasets.

02

Significantly improves accuracy over baseline models in occluded scenarios.

03

Demonstrates robustness to severe multi-person occlusion.

Abstract

Top-down methods for monocular human mesh recovery have two stages: (1) detect human bounding boxes; (2) treat each bounding box as an independent single-human mesh recovery task. Unfortunately, the single-human assumption does not hold in images with multi-human occlusion and crowding. Consequently, top-down methods have difficulties in recovering accurate 3D human meshes under severe person-person occlusion. To address this, we present Occluded Human Mesh Recovery (OCHMR) - a novel top-down mesh recovery approach that incorporates image spatial context to overcome the limitations of the single-human assumption. The approach is conceptually simple and can be applied to any existing top-down architecture. Along with the input image, we condition the top-down model on spatial context from the image in the form of body-center heatmaps. To reason from the predicted body centermaps, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods · Human Pose and Action Recognition · Image Enhancement Techniques