Loading paper
If you can describe it, they can see it: Cross-Modal Learning of Visual Concepts from Textual Descriptions | Tomesphere