ARCADE: Scalable Demonstration Collection and Generation via Augmented Reality for Imitation Learning
Yue Yang, Bryce Ikeda, Gedas Bertasius, Daniel Szafir

TL;DR
ARCADE introduces an augmented reality framework that simplifies demonstration collection and automatically generates synthetic data, enabling scalable robot imitation learning with minimal user effort across various tasks.
Contribution
The paper presents ARCADE, a novel AR-based system that streamlines demonstration collection and synthesizes additional data, significantly enhancing the scalability of robot imitation learning.
Findings
Successfully trained policies on multiple tasks using ARCADE
Achieved 80% success rate on household pouring task
Reduced demonstration collection time and effort
Abstract
Robot Imitation Learning (IL) is a crucial technique in robot learning, where agents learn by mimicking human demonstrations. However, IL encounters scalability challenges stemming from both non-user-friendly demonstration collection methods and the extensive time required to amass a sufficient number of demonstrations for effective training. In response, we introduce the Augmented Reality for Collection and generAtion of DEmonstrations (ARCADE) framework, designed to scale up demonstration collection for robot manipulation tasks. Our framework combines two key capabilities: 1) it leverages AR to make demonstration collection as simple as users performing daily tasks using their hands, and 2) it enables the automatic generation of additional synthetic demonstrations from a single human-derived demonstration, significantly reducing user effort and time. We assess ARCADE's performance on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · Artificial Intelligence in Games · Multimodal Machine Learning Applications
