Loading paper
Response Wide Shut: Surprising Observations in Basic Vision Language Model Capabilities | Tomesphere