Loading paper
Beyond Accuracy: Evaluating Grounded Visual Evidence in Thinking with Images | Tomesphere