Loading paper
GRiT: A Generative Region-to-text Transformer for Object Understanding | Tomesphere