Gaze-Guided Task Decomposition for Imitation Learning in Robotic   Manipulation

Ryo Takizawa; Yoshiyuki Ohmura; Yasuo Kuniyoshi

arXiv:2501.15071·cs.RO·February 28, 2025

Gaze-Guided Task Decomposition for Imitation Learning in Robotic Manipulation

Ryo Takizawa, Yoshiyuki Ohmura, Yasuo Kuniyoshi

PDF

Open Access 1 Repo

TL;DR

This paper introduces a gaze-based method for decomposing robotic manipulation tasks into sub-tasks, improving imitation learning by leveraging human gaze transitions during demonstrations.

Contribution

It presents a simple, robust gaze transition-based task decomposition method that ensures consistent sub-task segmentation across demonstrations in robotic manipulation.

Findings

01

Method is robust across different hyperparameters.

02

Ensures consistent task decomposition across demonstrations.

03

Applicable to various robotic systems.

Abstract

In imitation learning for robotic manipulation, decomposing object manipulation tasks into sub-tasks enables the reuse of learned skills and the combination of learned behaviors to perform novel tasks, rather than simply replicating demonstrated motions. Human gaze is closely linked to hand movements during object manipulation. We hypothesize that an imitating agent's gaze control, fixating on specific landmarks and transitioning between them, simultaneously segments demonstrated manipulations into sub-tasks. This study proposes a simple yet robust task decomposition method based on gaze transitions. Using teleoperation, a common modality in robotic manipulation for collecting demonstrations, in which a human operator's gaze is measured and used for task decomposition as a substitute for an imitating agent's gaze. Our approach ensures consistent task decomposition across all…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

crumbyRobotics/GazeTaskDecomp
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHand Gesture Recognition Systems · Robot Manipulation and Learning · Gaze Tracking and Assistive Technology