Loading paper
The ART of Composition: Attention-Regularized Training for Compositional Visual Grounding | Tomesphere