EgoZero: Robot Learning from Smart Glasses
Vincent Liu, Ademi Adeniji, Haotian Zhan, Siddhant Haldar, Raunaq Bhirangi, Pieter Abbeel, Lerrel Pinto

TL;DR
EgoZero leverages egocentric human demonstrations captured with smart glasses to train robot manipulation policies without robot data, achieving high success rates in real-world tasks with minimal data.
Contribution
This work introduces EgoZero, a system that learns robot manipulation policies from human egocentric data without requiring robot demonstrations, enabling scalable and generalizable robot learning.
Findings
70% success rate over 7 tasks with 20 minutes of data per task
Zero-shot transfer of policies to a physical robot
Effective extraction of robot actions from human egocentric videos
Abstract
Despite recent progress in general purpose robotics, robot policies still lag far behind basic human capabilities in the real world. Humans interact constantly with the physical world, yet this rich data resource remains largely untapped in robot learning. We propose EgoZero, a minimal system that learns robust manipulation policies from human demonstrations captured with Project Aria smart glasses, . EgoZero enables: (1) extraction of complete, robot-executable actions from in-the-wild, egocentric, human demonstrations, (2) compression of human visual observations into morphology-agnostic state representations, and (3) closed-loop policy learning that generalizes morphologically, spatially, and semantically. We deploy EgoZero policies on a gripper Franka Panda robot and demonstrate zero-shot transfer with 70% success rate over 7 manipulation tasks and only…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Multimodal Machine Learning Applications
MethodsAdaptive Richard's Curve Weighted Activation
