Loading paper
Multimodal Convolutional Neural Networks for Matching Image and Sentence | Tomesphere