Loading paper
Multimodal Transformer Distillation for Audio-Visual Synchronization | Tomesphere