Loading paper
Visual Referring Expression Recognition: What Do Systems Actually Learn? | Tomesphere