Loading paper
Evaluating the encoding competence of visual language models using uncommon actions | Tomesphere