Loading paper
ConTra: (Con)text (Tra)nsformer for Cross-Modal Video Retrieval | Tomesphere