Rethinking Deconvolution for 2D Human Pose Estimation Light yet Accurate Model for Real-time Edge Computing
Masayuki Yamazaki, Eigo Mori

TL;DR
This paper introduces a lightweight, efficient 2D human pose estimation model optimized for real-time edge computing, achieving high accuracy with significantly reduced computational cost and high frame rates on embedded devices.
Contribution
The study presents a novel downsized encoder-decoder model with optimized deconvolution layers, combined with model distillation and quantization techniques, enabling real-time, accurate pose estimation on low-power devices.
Findings
Achieved 94.5% SOTA accuracy with only 3.8% of the computational cost.
Operates at ~60 fps on NVIDIA Jetson AGX Xavier.
Achieves 70.0 AP with FP16 precision on COCO dataset.
Abstract
In this study, we present a pragmatic lightweight pose estimation model. Our model can achieve real-time predictions using low-power embedded devices. This system was found to be very accurate and achieved a 94.5% accuracy of SOTA HRNet 256x192 using a computational cost of only 3.8% on COCO test dataset. Our model adopts an encoder-decoder architecture and is carefully downsized to improve its efficiency. We especially focused on optimizing the deconvolution layers and observed that the channel reduction of the deconvolution layers contributes significantly to reducing computational resource consumption without degrading the accuracy of this system. We also incorporated recent model agnostic techniques such as DarkPose and distillation training to maximize the efficiency of our model. Furthermore, we applied model quantization to exploit multi/mixed precision features. Our FP16'ed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Anomaly Detection Techniques and Applications
MethodsConvolution · *Communicated@Fast*How Do I Communicate to Expedia? · Residual Connection · Batch Normalization · HRNet
