Loading paper
Rebalanced Vision-Language Retrieval Considering Structure-Aware Distillation | Tomesphere