Cognitive architecture aided by working-memory for self-supervised   multi-modal humans recognition

Jonas Gonzalez-Billandon; Giulia Belgiovine; Alessandra Sciutti,; Giulio Sandini; Francesco Rea

arXiv:2103.09072·cs.RO·March 17, 2021·1 cites

Cognitive architecture aided by working-memory for self-supervised multi-modal humans recognition

Jonas Gonzalez-Billandon, Giulia Belgiovine, Alessandra Sciutti,, Giulio Sandini, Francesco Rea

PDF

Open Access

TL;DR

This paper presents a cognitive architecture that combines perceptual processes with working memory to improve self-supervised multi-modal human recognition in robots, enhancing their autonomy in dynamic environments.

Contribution

It introduces a novel cognitive architecture integrating working memory with perception to enable autonomous, self-supervised learning for human recognition in robots.

Findings

01

Architecture effectively organizes sensory data for recognition.

02

Improves robot autonomy in dynamic, real-world scenarios.

03

Demonstrates promising results in multi-modal human recognition.

Abstract

The ability to recognize human partners is an important social skill to build personalized and long-term human-robot interactions, especially in scenarios like education, care-giving, and rehabilitation. Faces and voices constitute two important sources of information to enable artificial systems to reliably recognize individuals. Deep learning networks have achieved state-of-the-art results and demonstrated to be suitable tools to address such a task. However, when those networks are applied to different and unprecedented scenarios not included in the training set, they can suffer a drop in performance. For example, with robotic platforms in ever-changing and realistic environments, where always new sensory evidence is acquired, the performance of those models degrades. One solution is to make robots learn from their first-hand sensory data with self-supervision. This allows coping…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Face recognition and analysis · Advanced Image and Video Retrieval Techniques