ACRNet: Attention Cube Regression Network for Multi-view Real-time 3D Human Pose Estimation in Telemedicine
Boce Hu, Chenfei Zhu, Xupeng Ai, Sunil K. Agrawal

TL;DR
ACRNet is a real-time multi-view 3D human pose estimation network using attention cubes, demonstrating high accuracy and speed suitable for telemedicine and rehabilitation scenarios.
Contribution
This paper introduces ACRNet, a novel attention cube regression network that improves real-time 3D human pose estimation accuracy and efficiency in telemedicine applications.
Findings
Outperforms state-of-the-art methods on ITOP dataset
Validates effectiveness on a new rehabilitation dataset
Achieves high accuracy and real-time performance
Abstract
Human pose estimation (HPE) for 3D skeleton reconstruction in telemedicine has long received attention. Although the development of deep learning has made HPE methods in telemedicine simpler and easier to use, addressing low accuracy and high latency remains a big challenge. In this paper, we propose a novel multi-view Attention Cube Regression Network (ACRNet), which regresses the 3D position of joints in real time by aggregating informative attention points on each cube surface. More specially, a cube whose each surface contains uniformly distributed attention points with specific coordinate values is first created to wrap the target from the main view. Then, our network regresses the 3D position of each joint by summing and averaging the coordinates of attention points on each surface after being weighted. To verify our method, we first tested ACRNet on the open-source ITOP dataset;…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Hand Gesture Recognition Systems · Stroke Rehabilitation and Recovery
