Enabling a Social Robot to Process Social Cues to Detect when to Help a User
Jason R. Wilson, Phyo Thuta Aung, Isabelle Boucher

TL;DR
This paper presents a multimodal architecture enabling social robots to recognize when to assist users by analyzing social cues like eye gaze and language, aiming for real-time, context-independent assistance.
Contribution
It introduces a novel multimodal fusion approach using social cues for timely assistance, adaptable across various tasks and contexts.
Findings
Effective detection of user needs in a Lego task
Minimal task-specific dependencies in the architecture
Potential for improved user experience through social cue recognition
Abstract
It is important for socially assistive robots to be able to recognize when a user needs and wants help. Such robots need to be able to recognize human needs in a real-time manner so that they can provide timely assistance. We propose an architecture that uses social cues to determine when a robot should provide assistance. Based on a multimodal fusion approach upon eye gaze and language modalities, our architecture is trained and evaluated on data collected in a robot-assisted Lego building task. By focusing on social cues, our architecture has minimal dependencies on the specifics of a given task, enabling it to be applied in many different contexts. Enabling a social robot to recognize a user's needs through social cues can help it to adapt to user behaviors and preferences, which in turn will lead to improved user experiences.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSocial Robot Interaction and HRI · Gaze Tracking and Assistive Technology · Multimodal Machine Learning Applications
