Enabling robots to follow abstract instructions and complete complex   dynamic tasks

Ruaridh Mon-Williams; Gen Li; Ran Long; Wenqian Du; Chris Lucas

arXiv:2406.11231·cs.RO·June 18, 2024·1 cites

Enabling robots to follow abstract instructions and complete complex dynamic tasks

Ruaridh Mon-Williams, Gen Li, Ran Long, Wenqian Du, Chris Lucas

PDF

Open Access

TL;DR

This paper introduces a novel robotic framework that combines large language models, a curated knowledge base, and integrated feedback to enable robots to interpret high-level instructions and perform complex, dynamic tasks in unpredictable environments.

Contribution

The paper presents a new integrated approach using GPT-4, a curated database, and feedback mechanisms to improve robotic task execution from abstract instructions.

Findings

01

Successfully demonstrated in coffee making and plate decoration tasks

02

Enhanced robot adaptability to noise and disturbances during execution

03

Open-source implementation available for further research

Abstract

Completing complex tasks in unpredictable settings like home kitchens challenges robotic systems. These challenges include interpreting high-level human commands, such as "make me a hot beverage" and performing actions like pouring a precise amount of water into a moving mug. To address these challenges, we present a novel framework that combines Large Language Models (LLMs), a curated Knowledge Base, and Integrated Force and Visual Feedback (IFVF). Our approach interprets abstract instructions, performs long-horizon tasks, and handles various uncertainties. It utilises GPT-4 to analyse the user's query and surroundings, then generates code that accesses a curated database of functions during execution. It translates abstract instructions into actionable steps. Each step involves generating custom code by employing retrieval-augmented generalisation to pull IFVF-relevant examples from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · AI-based Problem Solving and Planning

MethodsAttention Is All You Need · Residual Connection · Softmax · Layer Normalization · Byte Pair Encoding · Label Smoothing · Adam · Linear Layer · Multi-Head Attention · Position-Wise Feed-Forward Layer