Loading paper
SAViR-T: Spatially Attentive Visual Reasoning with Transformers | Tomesphere