Loading paper
LFS: Learnable Frame Selector for Event-Aware and Temporally Diverse Video Captioning | Tomesphere