Loading paper
Self-Supervised Vision-Based Detection of the Active Speaker as Support for Socially-Aware Language Acquisition | Tomesphere