Learning a Thousand Tasks in a Day
Kamil Dreczkowski, Pietro Vitiello, Vitalis Vosylius, Edward Johns

TL;DR
This paper introduces a decomposition and retrieval-based imitation learning method called MT3, enabling robots to learn 1,000 tasks efficiently from minimal demonstrations, significantly surpassing traditional methods in data efficiency and generalization.
Contribution
The paper presents a novel decomposition and retrieval approach for imitation learning, achieving high data efficiency and scalability for robot manipulation tasks from very few demonstrations.
Findings
Decomposition improves data efficiency by an order of magnitude in few-demonstration regimes.
Retrieval-based methods outperform behavioral cloning in alignment and interaction phases.
MT3 can learn 1,000 tasks in under 24 hours of human demonstration time.
Abstract
Humans are remarkably efficient at learning tasks from demonstrations, but today's imitation learning methods for robot manipulation often require hundreds or thousands of demonstrations per task. We investigate two fundamental priors for improving learning efficiency: decomposing manipulation trajectories into sequential alignment and interaction phases, and retrieval-based generalisation. Through 3,450 real-world rollouts, we systematically study this decomposition. We compare different design choices for the alignment and interaction phases, and examine generalisation and scaling trends relative to today's dominant paradigm of behavioural cloning with a single-phase monolithic policy. In the few-demonstrations-per-task regime (<10 demonstrations), decomposition achieves an order of magnitude improvement in data efficiency over single-phase learning, with retrieval consistently…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · Social Robot Interaction and HRI
