Loading paper
Why are Visually-Grounded Language Models Bad at Image Classification? | Tomesphere