Loading paper
A Survey of Vision-Language Pre-Trained Models | Tomesphere