Loading paper
FIOVA: A Multi-Annotator Benchmark for Human-Aligned Video Captioning | Tomesphere