Loading paper
A Simple Yet Effective Method for Video Temporal Grounding with Cross-Modality Attention | Tomesphere