GPT Models Meet Robotic Applications: Co-Speech Gesturing Chat System
Naoki Wake, Atsushi Kanehira, Kazuhiro Sasabuchi, Jun Takamatsu,, Katsushi Ikeuchi

TL;DR
This paper presents a robotic chat system that combines large language models like GPT with co-speech gesture generation to create more responsive and visually engaging human-robot interactions.
Contribution
It introduces a novel integration of LLMs with gesture generation for robots, enhancing responsiveness and visual communication in robotic chat systems.
Findings
The system effectively generates contextually appropriate gestures.
Enhanced user engagement through visual gestures.
Open-source code available for replication.
Abstract
This technical paper introduces a chatting robot system that utilizes recent advancements in large-scale language models (LLMs) such as GPT-3 and ChatGPT. The system is integrated with a co-speech gesture generation system, which selects appropriate gestures based on the conceptual meaning of speech. Our motivation is to explore ways of utilizing the recent progress in LLMs for practical robotic applications, which benefits the development of both chatbots and LLMs. Specifically, it enables the development of highly responsive chatbot systems by leveraging LLMs and adds visual effects to the user interface of LLMs as an additional value. The source code for the system is available on GitHub for our in-house robot (https://github.com/microsoft/LabanotationSuite/tree/master/MSRAbotChatSimulation) and GitHub for Toyota HSR (https://github.com/microsoft/GPT-Enabled-HSR-CoSpeechGestures).
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in Service Interactions · Topic Modeling
