Loading paper
Response Wide Shut? Surprising Observations in Basic Vision Language Model Capabilities | Tomesphere