Loading paper
Towards Generalisable Video Moment Retrieval: Visual-Dynamic Injection to Image-Text Pre-Training | Tomesphere