Forgetful Large Language Models: Lessons Learned from Using LLMs in   Robot Programming

Juo-Tung Chen; Chien-Ming Huang

arXiv:2310.06646·cs.RO·October 11, 2023·1 cites

Forgetful Large Language Models: Lessons Learned from Using LLMs in Robot Programming

Juo-Tung Chen, Chien-Ming Huang

PDF

Open Access

TL;DR

This paper investigates common execution errors in LLM-generated robot code, identifies 'forgetfulness' as a key issue, and proposes prompt engineering strategies to improve reliability across multiple models.

Contribution

It introduces a categorization of errors in robot programming with LLMs, highlights 'forgetfulness' as a core problem, and offers prompt engineering tactics to mitigate execution errors.

Findings

01

Prompt engineering reduces execution errors in LLM-generated robot code.

02

Effectiveness demonstrated across ChatGPT, Bard, and LLaMA-2.

03

Calls for benchmarking LLMs in robot application development.

Abstract

Large language models offer new ways of empowering people to program robot applications-namely, code generation via prompting. However, the code generated by LLMs is susceptible to errors. This work reports a preliminary exploration that empirically characterizes common errors produced by LLMs in robot programming. We categorize these errors into two phases: interpretation and execution. In this work, we focus on errors in execution and observe that they are caused by LLMs being "forgetful" of key information provided in user prompts. Based on this observation, we propose prompt engineering tactics designed to reduce errors in execution. We then demonstrate the effectiveness of these tactics with three language models: ChatGPT, Bard, and LLaMA-2. Finally, we discuss lessons learned from using LLMs in robot programming and call for the benchmarking of LLM-powered end-user development of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Artificial Intelligence in Healthcare and Education · Software Engineering Research