Loading paper
NaturalBench: Evaluating Vision-Language Models on Natural Adversarial Samples | Tomesphere