Loading paper
Human-like Controllable Image Captioning with Verb-specific Semantic Roles | Tomesphere