Loading paper
Text-Visual Prompting for Efficient 2D Temporal Video Grounding | Tomesphere