Loading paper
Can Vision-Language Models Solve the Shell Game? | Tomesphere