Loading paper
MERVIN: A Unified Framework for Multimodal Event Retrieval in Vietnamese News Videos | Tomesphere