Loading paper
Are we pretraining it right? Digging deeper into visio-linguistic pretraining | Tomesphere