Loading paper
Visual Question Answering based on Local-Scene-Aware Referring Expression Generation | Tomesphere