Loading paper
EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching | Tomesphere