THETA: Triangulated Hand-State Estimation for Teleoperation and Automation in Robotic Hand Control
Alex Huang, Akshay Karthik

TL;DR
THETA introduces a low-cost, triangulation-based method using three webcams and deep learning for real-time hand joint angle estimation, enabling affordable teleoperation of robotic hands.
Contribution
A novel triangulation approach with deep learning for cost-effective, real-time hand pose estimation in robotic teleoperation.
Findings
Achieved 97.18% accuracy in joint angle classification.
Demonstrated real-time hand control of a low-cost robotic hand.
Processed over 48,000 images for model training and validation.
Abstract
The teleoperation of robotic hands is limited by the high costs of depth cameras and sensor gloves, commonly used to estimate hand relative joint positions (XYZ). We present a novel, cost-effective approach using three webcams for triangulation-based tracking to approximate relative joint angles (theta) of human fingers. We also introduce a modified DexHand, a low-cost robotic hand from TheRobotStudio, to demonstrate THETA's real-time application. Data collection involved 40 distinct hand gestures using three 640x480p webcams arranged at 120-degree intervals, generating over 48,000 RGB images. Joint angles were manually determined by measuring midpoints of the MCP, PIP, and DIP finger joints. Captured RGB frames were processed using a DeepLabV3 segmentation model with a ResNet-50 backbone for multi-scale hand segmentation. The segmented images were then HSV-filtered and fed into THETA's…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHand Gesture Recognition Systems · Robot Manipulation and Learning · Human Pose and Action Recognition
