Speech-Guided Sequential Planning for Autonomous Navigation using Large Language Model Meta AI 3 (Llama3)
Alkesh K. Srivastava, Philip Dames

TL;DR
This paper introduces a speech-guided sequential planning system for autonomous robot navigation that leverages Llama3 for natural language understanding and DRL-VO for control, demonstrated through simulation and real-world experiments.
Contribution
It presents a novel integration of Llama3 with ROS for natural language-based navigation planning and combines it with learning-based control for social space navigation.
Findings
Effective speech-guided navigation in simulation
Successful hardware trials with real robot
Potential for real-world social robotics applications
Abstract
In social robotics, a pivotal focus is enabling robots to engage with humans in a more natural and seamless manner. The emergence of advanced large language models (LLMs) such as Generative Pre-trained Transformers (GPTs) and autoregressive models like Large Language Model Meta AI (Llamas) has driven significant advancements in integrating natural language understanding capabilities into social robots. This paper presents a system for speech-guided sequential planning in autonomous navigation, utilizing Llama3 and the Robot Operating System~(ROS). The proposed system involves using Llama3 to interpret voice commands, extracting essential details through parsing, and decoding these commands into sequential actions for tasks. Such sequential planning is essential in various domains, particularly in the pickup and delivery of an object. Once a sequential navigation task is evaluated, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Speech and dialogue systems · Robotics and Automated Systems
