Loading paper
More Than Just Attention: Improving Cross-Modal Attentions with Contrastive Constraints for Image-Text Matching | Tomesphere