Loading paper
Multi-level Multimodal Common Semantic Space for Image-Phrase Grounding | Tomesphere