Loading paper
Lightweight Attentional Feature Fusion: A New Baseline for Text-to-Video Retrieval | Tomesphere