Speech-Guided Sequential Planning for Autonomous Navigation using Large   Language Model Meta AI 3 (Llama3)

Alkesh K. Srivastava; Philip Dames

arXiv:2407.09890·cs.RO·September 30, 2024

Speech-Guided Sequential Planning for Autonomous Navigation using Large Language Model Meta AI 3 (Llama3)

Alkesh K. Srivastava, Philip Dames

PDF

Open Access

TL;DR

This paper introduces a speech-guided sequential planning system for autonomous robot navigation that leverages Llama3 for natural language understanding and DRL-VO for control, demonstrated through simulation and real-world experiments.

Contribution

It presents a novel integration of Llama3 with ROS for natural language-based navigation planning and combines it with learning-based control for social space navigation.

Findings

01

Effective speech-guided navigation in simulation

02

Successful hardware trials with real robot

03

Potential for real-world social robotics applications

Abstract

In social robotics, a pivotal focus is enabling robots to engage with humans in a more natural and seamless manner. The emergence of advanced large language models (LLMs) such as Generative Pre-trained Transformers (GPTs) and autoregressive models like Large Language Model Meta AI (Llamas) has driven significant advancements in integrating natural language understanding capabilities into social robots. This paper presents a system for speech-guided sequential planning in autonomous navigation, utilizing Llama3 and the Robot Operating System~(ROS). The proposed system involves using Llama3 to interpret voice commands, extracting essential details through parsing, and decoding these commands into sequential actions for tasks. Such sequential planning is essential in various domains, particularly in the pickup and delivery of an object. Once a sequential navigation task is evaluated, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Speech and dialogue systems · Robotics and Automated Systems