Loading paper
VAEmo: Efficient Representation Learning for Visual-Audio Emotion with Knowledge Injection | Tomesphere