iCub Knows Where You Look: Exploiting Social Cues for Interactive Object Detection Learning
Maria Lombardi, Elisa Maiettini, Vadim Tikhanoff, Lorenzo Natale

TL;DR
This paper presents a social cue-based approach enabling the iCub robot to learn new objects through natural human interaction, leveraging joint attention and social signals to improve interactive object detection in dynamic environments.
Contribution
It introduces a novel method combining social cues like gaze and speech to facilitate robot learning of objects during interactive sessions.
Findings
Robot successfully learns new objects via social interaction.
Social cues improve the robustness of object detection.
System operates effectively amidst distractors.
Abstract
Performing joint interaction requires constant mutual monitoring of own actions and their effects on the other's behaviour. Such an action-effect monitoring is boosted by social cues and might result in an increasing sense of agency. Joint actions and joint attention are strictly correlated and both of them contribute to the formation of a precise temporal coordination. In human-robot interaction, the robot's ability to establish joint attention with a human partner and exploit various social cues to react accordingly is a crucial step in creating communicative robots. Along the social component, an effective human-robot interaction can be seen as a new method to improve and make the robot's learning process more natural and robust for a given task. In this work we use different social skills, such as mutual gaze, gaze following, speech and human face recognition, to develop an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSocial Robot Interaction and HRI · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning
