Experimental Study on the Imitation of the Human Head-and-Eye Pose Using   the 3-DOF Agile Eye Parallel Robot with ROS and Mediapipe Framework

Amirmohammad Radmehr; Milad Asgari; Mehdi Tale Masouleh

arXiv:2111.00452·cs.RO·November 5, 2021

Experimental Study on the Imitation of the Human Head-and-Eye Pose Using the 3-DOF Agile Eye Parallel Robot with ROS and Mediapipe Framework

Amirmohammad Radmehr, Milad Asgari, Mehdi Tale Masouleh

PDF

Open Access

TL;DR

This study combines computer vision and robotic control to imitate human head and eye movements using a 3-DOF parallel robot, leveraging ROS, Mediapipe, and machine learning for pose estimation.

Contribution

It introduces a novel robotic system that mimics human head and eye movements with two methods for face pose estimation, integrating ROS and machine learning techniques.

Findings

01

Mediapipe provides high-fidelity face pose tracking.

02

Linear regression effectively estimates face pose angles.

03

The robotic system successfully imitates human head and eye movements.

Abstract

In this paper, a method to mimic a human face and eyes is proposed which can be regarded as a combination of computer vision techniques and neural network concepts. From a mechanical standpoint, a 3-DOF spherical parallel robot is used which imitates the human head movement. In what concerns eye movement, a 2-DOF mechanism is attached to the end-effector of the 3-DOF spherical parallel mechanism. In order to have robust and reliable results for the imitation, meaningful information should be extracted from the face mesh for obtaining the pose of a face, i.e., the roll, yaw, and pitch angles. To this end, two methods are proposed where each of them has its own pros and cons. The first method consists in resorting to the so-called Mediapipe library which is a machine learning solution for high-fidelity body pose tracking, introduced by Google. As the second method, a model is trained by a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaze Tracking and Assistive Technology