Loading paper
Delving Deeper: Hierarchical Visual Perception for Robust Video-Text Retrieval | Tomesphere