Loading paper
VisualBERT: A Simple and Performant Baseline for Vision and Language | Tomesphere