Loading paper
Knowledge Distillation for Efficient Audio-Visual Video Captioning | Tomesphere