Loading paper
Multi-Scale Contrastive Learning for Video Temporal Grounding | Tomesphere