Interactive Language: Talking to Robots in Real Time

Corey Lynch; Ayzaan Wahid; Jonathan Tompson; Tianli Ding; James; Betker; Robert Baruch; Travis Armstrong; Pete Florence

arXiv:2210.06407·cs.RO·October 13, 2022·20 cites

Interactive Language: Talking to Robots in Real Time

Corey Lynch, Ayzaan Wahid, Jonathan Tompson, Tianli Ding, James, Betker, Robert Baruch, Travis Armstrong, Pete Florence

PDF

Open Access 1 Repo 5 Datasets

TL;DR

This paper introduces a real-time, natural language-interactable robot framework trained on a large dataset, achieving high success in executing diverse commands and guided by human language for complex tasks.

Contribution

It presents a new framework and open-source assets for building interactive robots capable of understanding and executing a wide range of natural language commands in real time.

Findings

01

93.5% success rate on 87,000 commands

02

Proficient execution of diverse visuo-linguo-motor skills

03

Guided by human language for complex, long-horizon tasks

Abstract

We present a framework for building interactive, real-time, natural language-instructable robots in the real world, and we open source related assets (dataset, environment, benchmark, and policies). Trained with behavioral cloning on a dataset of hundreds of thousands of language-annotated trajectories, a produced policy can proficiently execute an order of magnitude more commands than previous works: specifically we estimate a 93.5% success rate on a set of 87,000 unique natural language strings specifying raw end-to-end visuo-linguo-motor skills in the real world. We find that the same policy is capable of being guided by a human via real-time language to address a wide range of precise long-horizon rearrangement goals, e.g. "make a smiley face out of blocks". The dataset we release comprises nearly 600,000 language-labeled trajectories, an order of magnitude larger than prior…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

google-research/language-table
jax

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · Reinforcement Learning in Robotics