Loading paper
A Closer Look at the Robustness of Vision-and-Language Pre-trained Models | Tomesphere