NoReGeo: Non-Reasoning Geometry Benchmark

Irina Abdullaeva; Anton Vasiliuk; Elizaveta Goncharova; Temurbek Rahmatullaev; Zagorulko Ivan; Maxim Kurkin; Andrey Kuznetsov

arXiv:2601.10254·cs.AI·January 16, 2026

NoReGeo: Non-Reasoning Geometry Benchmark

Irina Abdullaeva, Anton Vasiliuk, Elizaveta Goncharova, Temurbek Rahmatullaev, Zagorulko Ivan, Maxim Kurkin, Andrey Kuznetsov

PDF

Open Access 1 Video

TL;DR

NoReGeo is a benchmark that evaluates large language models' innate geometric understanding without reasoning, revealing current models' limited ability to grasp geometric concepts directly.

Contribution

The paper introduces NoReGeo, a new benchmark for assessing intrinsic geometric understanding in LLMs, highlighting the gap in native geometric cognition.

Findings

01

State-of-the-art models achieve up to 65% accuracy on geometric tasks.

02

Geometric understanding does not improve significantly through fine-tuning.

03

Current models lack innate geometric comprehension, as shown by benchmark results.

Abstract

We present NoReGeo, a novel benchmark designed to evaluate the intrinsic geometric understanding of large language models (LLMs) without relying on reasoning or algebraic computation. Unlike existing benchmarks that primarily assess models' proficiency in reasoning-based geometry-where solutions are derived using algebraic methods-NoReGeo focuses on evaluating whether LLMs can inherently encode spatial relationships and recognize geometric properties directly. Our benchmark comprises 2,500 trivial geometric problems spanning 25 categories, each carefully crafted to be solvable purely through native geometric understanding, assuming known object locations. We assess a range of state-of-the-art models on NoReGeo, including frontier models like GPT-4, observing that even the most advanced systems achieve an overall maximum of 65% accuracy in binary classification tasks. Further, our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

NoReGeo: Non-Reasoning Geometry Benchmark· underline

Taxonomy

TopicsMultimodal Machine Learning Applications · Advanced Graph Neural Networks · Topic Modeling