Loading paper
Self-supervised Contrastive Learning for Audio-Visual Action Recognition | Tomesphere