Loading paper
Local-Global Video-Text Interactions for Temporal Grounding | Tomesphere