Enabling robots to follow abstract instructions and complete complex dynamic tasks
Ruaridh Mon-Williams, Gen Li, Ran Long, Wenqian Du, Chris Lucas

TL;DR
This paper introduces a novel robotic framework that combines large language models, a curated knowledge base, and integrated feedback to enable robots to interpret high-level instructions and perform complex, dynamic tasks in unpredictable environments.
Contribution
The paper presents a new integrated approach using GPT-4, a curated database, and feedback mechanisms to improve robotic task execution from abstract instructions.
Findings
Successfully demonstrated in coffee making and plate decoration tasks
Enhanced robot adaptability to noise and disturbances during execution
Open-source implementation available for further research
Abstract
Completing complex tasks in unpredictable settings like home kitchens challenges robotic systems. These challenges include interpreting high-level human commands, such as "make me a hot beverage" and performing actions like pouring a precise amount of water into a moving mug. To address these challenges, we present a novel framework that combines Large Language Models (LLMs), a curated Knowledge Base, and Integrated Force and Visual Feedback (IFVF). Our approach interprets abstract instructions, performs long-horizon tasks, and handles various uncertainties. It utilises GPT-4 to analyse the user's query and surroundings, then generates code that accesses a curated database of functions during execution. It translates abstract instructions into actionable steps. Each step involves generating custom code by employing retrieval-augmented generalisation to pull IFVF-relevant examples from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · AI-based Problem Solving and Planning
MethodsAttention Is All You Need · Residual Connection · Softmax · Layer Normalization · Byte Pair Encoding · Label Smoothing · Adam · Linear Layer · Multi-Head Attention · Position-Wise Feed-Forward Layer
