Loading paper
Contra4: Evaluating Contrastive Cross-Modal Reasoning in Audio, Video, Image, and 3D | Tomesphere