TL;DR
This paper introduces BMP, a single-stage method for multi-person 3D body mesh estimation that improves efficiency and accuracy by representing persons as points in space and directly predicting their meshes.
Contribution
BMP is the first single-stage model that concurrently localizes person points and estimates body meshes, reducing complexity and enhancing performance in multi-person scenes.
Findings
Achieves state-of-the-art efficiency on benchmarks.
Demonstrates high accuracy in complex scenes with occlusions.
Outperforms existing multi-stage methods in speed and precision.
Abstract
We consider the challenging multi-person 3D body mesh estimation task in this work. Existing methods are mostly two-stage based--one stage for person localization and the other stage for individual body mesh estimation, leading to redundant pipelines with high computation cost and degraded performance for complex scenes (e.g., occluded person instances). In this work, we present a single-stage model, Body Meshes as Points (BMP), to simplify the pipeline and lift both efficiency and performance. In particular, BMP adopts a new method that represents multiple person instances as points in the spatial-depth space where each point is associated with one body mesh. Hinging on such representations, BMP can directly predict body meshes for multiple persons in a single stage by concurrently localizing person instance points and estimating the corresponding body meshes. To better reason about…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
