EgoZero: Robot Learning from Smart Glasses

Vincent Liu; Ademi Adeniji; Haotian Zhan; Siddhant Haldar; Raunaq Bhirangi; Pieter Abbeel; Lerrel Pinto

arXiv:2505.20290·cs.RO·June 5, 2025

EgoZero: Robot Learning from Smart Glasses

Vincent Liu, Ademi Adeniji, Haotian Zhan, Siddhant Haldar, Raunaq Bhirangi, Pieter Abbeel, Lerrel Pinto

PDF

Open Access

TL;DR

EgoZero leverages egocentric human demonstrations captured with smart glasses to train robot manipulation policies without robot data, achieving high success rates in real-world tasks with minimal data.

Contribution

This work introduces EgoZero, a system that learns robot manipulation policies from human egocentric data without requiring robot demonstrations, enabling scalable and generalizable robot learning.

Findings

01

70% success rate over 7 tasks with 20 minutes of data per task

02

Zero-shot transfer of policies to a physical robot

03

Effective extraction of robot actions from human egocentric videos

Abstract

Despite recent progress in general purpose robotics, robot policies still lag far behind basic human capabilities in the real world. Humans interact constantly with the physical world, yet this rich data resource remains largely untapped in robot learning. We propose EgoZero, a minimal system that learns robust manipulation policies from human demonstrations captured with Project Aria smart glasses, $and zero robot data$ . EgoZero enables: (1) extraction of complete, robot-executable actions from in-the-wild, egocentric, human demonstrations, (2) compression of human visual observations into morphology-agnostic state representations, and (3) closed-loop policy learning that generalizes morphologically, spatially, and semantically. We deploy EgoZero policies on a gripper Franka Panda robot and demonstrate zero-shot transfer with 70% success rate over 7 manipulation tasks and only…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Multimodal Machine Learning Applications

MethodsAdaptive Richard's Curve Weighted Activation