Efficient Human Pose Estimation: Leveraging Advanced Techniques with MediaPipe
Sandeep Singh Sengar, Abhishek Kumar, Owen Singh

TL;DR
This paper enhances human pose estimation using MediaPipe by improving accuracy and efficiency, enabling real-time applications across various fields like AR, sports, and healthcare, especially on mobile and embedded devices.
Contribution
Introduces novel modifications to MediaPipe that significantly boost pose estimation accuracy and computational speed in real-time scenarios.
Findings
Enhanced accuracy in dynamic and occluded scenarios
Significant speed improvements over traditional models
Successful deployment on mobile and embedded systems
Abstract
This study presents significant enhancements in human pose estimation using the MediaPipe framework. The research focuses on improving accuracy, computational efficiency, and real-time processing capabilities by comprehensively optimising the underlying algorithms. Novel modifications are introduced that substantially enhance pose estimation accuracy across challenging scenarios, such as dynamic movements and partial occlusions. The improved framework is benchmarked against traditional models, demonstrating considerable precision and computational speed gains. The advancements have wide-ranging applications in augmented reality, sports analytics, and healthcare, enabling more immersive experiences, refined performance analysis, and advanced patient monitoring. The study also explores the integration of these enhancements within mobile and embedded systems, addressing the need for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Gait Recognition and Analysis · Anomaly Detection Techniques and Applications
MethodsSparse Evolutionary Training · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
