Loading paper
Grid-VLP: Revisiting Grid Features for Vision-Language Pre-training | Tomesphere