Loading paper
GIT: A Generative Image-to-text Transformer for Vision and Language | Tomesphere