Loading paper
BagFormer: Better Cross-Modal Retrieval via bag-wise interaction | Tomesphere