Loading paper
Self-Supervised learning with cross-modal transformers for emotion recognition | Tomesphere