Loading paper
Unifying Multimodal Transformer for Bi-directional Image and Text Generation | Tomesphere