Minimal Self in Humanoid Robot "Alter3" Driven by Large Language Model
Takahide Yoshida, Suzune Baba, Atsushi Masumori, Takashi Ikegami

TL;DR
This paper presents Alter3, a humanoid robot driven by GPT-4 that can generate spontaneous movements and potentially develop a minimal sense of self through self-recognition tests.
Contribution
It demonstrates how large language models can be integrated with humanoid robots to enable autonomous actions and explore self-awareness.
Findings
Alter3 can perform tasks based on linguistic instructions.
Mirror self-recognition tests indicate potential for a sense of agency.
Language models can contribute to robot self-awareness.
Abstract
This paper introduces Alter3, a humanoid robot that demonstrates spontaneous motion generation through the integration of GPT-4, Large Language Model (LLM). This overcomes challenges in applying language models to direct robot control. By translating linguistic descriptions into actions, Alter3 can autonomously perform various tasks. The key aspect of humanoid robots is their ability to mimic human movement and emotions, allowing them to leverage human knowledge from language models. This raises the question of whether Alter3+GPT-4 can develop a "minimal self" with a sense of agency and ownership. This paper introduces mirror self-recognition and rubber hand illusion tests to assess Alter3's potential for a sense of self. The research suggests that even disembodied language models can develop agency when coupled with a physical robotic platform.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Automated Systems
