Loading paper
LVD-2M: A Long-take Video Dataset with Temporally Dense Captions | Tomesphere