Loading paper
Compressing then Matching: An Efficient Pre-training Paradigm for Multimodal Embedding | Tomesphere