Loading paper
cross-modal fusion techniques for utterance-level emotion recognition from text and speech | Tomesphere