RTMPose: Real-Time Multi-Person Pose Estimation based on MMPose
Tao Jiang, Peng Lu, Li Zhang, Ningsheng Ma, Rui Han, Chengqi Lyu,, Yining Li, Kai Chen

TL;DR
RTMPose is a high-performance, real-time multi-person pose estimation framework based on MMPose, achieving state-of-the-art accuracy and speed on various hardware, including mobile devices, to bridge the gap between research and industrial applications.
Contribution
The paper introduces RTMPose, a novel real-time multi-person pose estimation framework that balances high accuracy with low latency, optimized for diverse hardware including mobile devices.
Findings
RTMPose-m achieves 75.8% AP at over 90 FPS on CPU.
RTMPose-l attains 67.0% AP on COCO-WholeBody at 130+ FPS.
RTMPose-s runs at 70+ FPS on Snapdragon 865, outperforming existing open-source solutions.
Abstract
Recent studies on 2D pose estimation have achieved excellent performance on public benchmarks, yet its application in the industrial community still suffers from heavy model parameters and high latency. In order to bridge this gap, we empirically explore key factors in pose estimation including paradigm, model architecture, training strategy, and deployment, and present a high-performance real-time multi-person pose estimation framework, RTMPose, based on MMPose. Our RTMPose-m achieves 75.8% AP on COCO with 90+ FPS on an Intel i7-11700 CPU and 430+ FPS on an NVIDIA GTX 1660 Ti GPU, and RTMPose-l achieves 67.0% AP on COCO-WholeBody with 130+ FPS. To further evaluate RTMPose's capability in critical real-time applications, we also report the performance after deploying on the mobile device. Our RTMPose-s achieves 72.2% AP on COCO with 70+ FPS on a Snapdragon 865 chip, outperforming…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Hand Gesture Recognition Systems · Robot Manipulation and Learning
