Loading paper
Probing Vision-Language Understanding through the Visual Entailment Task: promises and pitfalls | Tomesphere