Loading paper
Multi-Scale Temporal Difference Transformer for Video-Text Retrieval | Tomesphere