Loading paper
From Perception to Action: An Interactive Benchmark for Vision Reasoning | Tomesphere