MEBOW: Monocular Estimation of Body Orientation In the Wild
Chenyan Wu, Yukun Chen, Jiajia Luo, Che-Chun Su, Anuja Dawane,, Bikramjot Hanzra, Zhuo Deng, Bilan Liu, James Wang, Cheng-Hao Kuo

TL;DR
MEBOW introduces a large-scale in-the-wild dataset for body orientation estimation, significantly improving model robustness and enabling a novel joint training approach that enhances 3-D human pose estimation from monocular images.
Contribution
The paper presents COCO-MEBOW, a new dataset with 130K body orientation labels, and a triple-source training method that advances monocular 3-D human pose estimation.
Findings
Dataset improves orientation estimation accuracy and robustness.
Joint training with body orientation enhances 3-D pose estimation performance.
Model outperforms state-of-the-art dual-source methods.
Abstract
Body orientation estimation provides crucial visual cues in many applications, including robotics and autonomous driving. It is particularly desirable when 3-D pose estimation is difficult to infer due to poor image resolution, occlusion or indistinguishable body parts. We present COCO-MEBOW (Monocular Estimation of Body Orientation in the Wild), a new large-scale dataset for orientation estimation from a single in-the-wild image. The body-orientation labels for around 130K human bodies within 55K images from the COCO dataset have been collected using an efficient and high-precision annotation pipeline. We also validated the benefits of the dataset. First, we show that our dataset can substantially improve the performance and the robustness of a human body orientation estimation model, the development of which was previously limited by the scale and diversity of the available training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
MEBOW: Monocular Estimation of Body Orientation in the Wild· youtube
Taxonomy
TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Advanced Neural Network Applications
