Loading paper
Structured Multi-modal Feature Embedding and Alignment for Image-Sentence Retrieval | Tomesphere