Experimental Study on the Imitation of the Human Head-and-Eye Pose Using the 3-DOF Agile Eye Parallel Robot with ROS and Mediapipe Framework
Amirmohammad Radmehr, Milad Asgari, Mehdi Tale Masouleh

TL;DR
This study combines computer vision and robotic control to imitate human head and eye movements using a 3-DOF parallel robot, leveraging ROS, Mediapipe, and machine learning for pose estimation.
Contribution
It introduces a novel robotic system that mimics human head and eye movements with two methods for face pose estimation, integrating ROS and machine learning techniques.
Findings
Mediapipe provides high-fidelity face pose tracking.
Linear regression effectively estimates face pose angles.
The robotic system successfully imitates human head and eye movements.
Abstract
In this paper, a method to mimic a human face and eyes is proposed which can be regarded as a combination of computer vision techniques and neural network concepts. From a mechanical standpoint, a 3-DOF spherical parallel robot is used which imitates the human head movement. In what concerns eye movement, a 2-DOF mechanism is attached to the end-effector of the 3-DOF spherical parallel mechanism. In order to have robust and reliable results for the imitation, meaningful information should be extracted from the face mesh for obtaining the pose of a face, i.e., the roll, yaw, and pitch angles. To this end, two methods are proposed where each of them has its own pros and cons. The first method consists in resorting to the so-called Mediapipe library which is a machine learning solution for high-fidelity body pose tracking, introduced by Google. As the second method, a model is trained by a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaze Tracking and Assistive Technology
