Loading paper
GRASP: A Grid-Based Benchmark for Evaluating Commonsense Spatial Reasoning | Tomesphere