Loading paper
Listening without Looking: Modality Bias in Audio-Visual Captioning | Tomesphere