Loading paper
Masking Modalities for Cross-modal Video Retrieval | Tomesphere