Loading paper
Temporal aggregation of audio-visual modalities for emotion recognition | Tomesphere