Cognitive architecture aided by working-memory for self-supervised multi-modal humans recognition
Jonas Gonzalez-Billandon, Giulia Belgiovine, Alessandra Sciutti,, Giulio Sandini, Francesco Rea

TL;DR
This paper presents a cognitive architecture that combines perceptual processes with working memory to improve self-supervised multi-modal human recognition in robots, enhancing their autonomy in dynamic environments.
Contribution
It introduces a novel cognitive architecture integrating working memory with perception to enable autonomous, self-supervised learning for human recognition in robots.
Findings
Architecture effectively organizes sensory data for recognition.
Improves robot autonomy in dynamic, real-world scenarios.
Demonstrates promising results in multi-modal human recognition.
Abstract
The ability to recognize human partners is an important social skill to build personalized and long-term human-robot interactions, especially in scenarios like education, care-giving, and rehabilitation. Faces and voices constitute two important sources of information to enable artificial systems to reliably recognize individuals. Deep learning networks have achieved state-of-the-art results and demonstrated to be suitable tools to address such a task. However, when those networks are applied to different and unprecedented scenarios not included in the training set, they can suffer a drop in performance. For example, with robotic platforms in ever-changing and realistic environments, where always new sensory evidence is acquired, the performance of those models degrades. One solution is to make robots learn from their first-hand sensory data with self-supervision. This allows coping…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Face recognition and analysis · Advanced Image and Video Retrieval Techniques
