Loading paper
A vector quantized masked autoencoder for audiovisual speech emotion recognition | Tomesphere