Loading paper
CLEVR-Ref+: Diagnosing Visual Reasoning with Referring Expressions | Tomesphere