EgoMimic: Scaling Imitation Learning via Egocentric Video

Simar Kareer; Dhruv Patel; Ryan Punamiya; Pranay Mathur; Shuo Cheng,; Chen Wang; Judy Hoffman; Danfei Xu

arXiv:2410.24221·cs.RO·November 1, 2024·3 cites

EgoMimic: Scaling Imitation Learning via Egocentric Video

Simar Kareer, Dhruv Patel, Ryan Punamiya, Pranay Mathur, Shuo Cheng,, Chen Wang, Judy Hoffman, Danfei Xu

PDF

Open Access 1 Repo

TL;DR

EgoMimic introduces a comprehensive framework that leverages egocentric human videos and 3D hand tracking to significantly improve manipulation tasks in robots, enabling better generalization and scaling of imitation learning.

Contribution

The paper presents a full-stack system combining data collection, alignment, and co-training that treats human and robot data equally for imitation learning, advancing beyond high-level intent extraction.

Findings

01

EgoMimic outperforms prior imitation methods on various manipulation tasks.

02

Adding 1 hour of human hand data yields more improvement than 1 hour of robot data.

03

The approach enables generalization to new scenes and tasks.

Abstract

The scale and diversity of demonstration data required for imitation learning is a significant challenge. We present EgoMimic, a full-stack framework which scales manipulation via human embodiment data, specifically egocentric human videos paired with 3D hand tracking. EgoMimic achieves this through: (1) a system to capture human embodiment data using the ergonomic Project Aria glasses, (2) a low-cost bimanual manipulator that minimizes the kinematic gap to human data, (3) cross-domain data alignment techniques, and (4) an imitation learning architecture that co-trains on human and robot data. Compared to prior works that only extract high-level intent from human videos, our approach treats human and robot data equally as embodied demonstration data and learns a unified policy from both data sources. EgoMimic achieves significant improvement on a diverse set of long-horizon, single-arm…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

SimarKareer/EgoMimic
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Games and Media · Educational Games and Gamification · Human Motion and Animation

MethodsAdaptive Richard's Curve Weighted Activation · Sparse Evolutionary Training