Loading paper
CLIP Models are Few-shot Learners: Empirical Studies on VQA and Visual Entailment | Tomesphere