Loading paper
Object-aware Video-language Pre-training for Retrieval | Tomesphere