Loading paper
ViSIL: Unified Evaluation of Information Loss in Multimodal Video Captioning | Tomesphere