Loading paper
TxT: Crossmodal End-to-End Learning with Transformers | Tomesphere