Toward a Dialogue System Using a Large Language Model to Recognize User Emotions with a Camera
Hiroki Tanioka, Tetsushi Ueta, Masahiko Sano

TL;DR
This paper explores integrating facial expression-based emotion recognition into large language model dialogue systems to enable AI agents to adapt their responses according to user emotions, enhancing multimodal interaction.
Contribution
It introduces a method for LLM-based AI agents to recognize user emotions from facial expressions via camera and incorporate this information into dialogue prompts.
Findings
AI agents can respond appropriately to emotions like Happy and Angry.
Emotion recognition from facial expressions improves dialogue relevance.
Multimodal emotion-aware dialogue systems are feasible with current LLMs.
Abstract
The performance of ChatGPT\copyright{} and other LLMs has improved tremendously, and in online environments, they are increasingly likely to be used in a wide variety of situations, such as ChatBot on web pages, call center operations using voice interaction, and dialogue functions using agents. In the offline environment, multimodal dialogue functions are also being realized, such as guidance by Artificial Intelligence agents (AI agents) using tablet terminals and dialogue systems in the form of LLMs mounted on robots. In this multimodal dialogue, mutual emotion recognition between the AI and the user will become important. So far, there have been methods for expressing emotions on the part of the AI agent or for recognizing them using textual or voice information of the user's utterances, but methods for AI agents to recognize emotions from the user's facial expressions have not been…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSocial Robot Interaction and HRI · Speech and dialogue systems
