AlgBench: To What Extent Do Large Reasoning Models Understand Algorithms?

Henan Sun; Kaichi Yu; Yuyao Wang; Bowen Liu; Xunkai Li; Rong-Hua Li; Nuo Chen; Jia Li

arXiv:2601.04996·cs.AI·January 12, 2026

AlgBench: To What Extent Do Large Reasoning Models Understand Algorithms?

Henan Sun, Kaichi Yu, Yuyao Wang, Bowen Liu, Xunkai Li, Rong-Hua Li, Nuo Chen, Jia Li

PDF

Open Access

TL;DR

This paper introduces AlgBench, a comprehensive benchmark for evaluating large reasoning models' understanding of algorithms, revealing significant performance gaps and strategic limitations in current models.

Contribution

We present AlgBench, a new expert-curated benchmark with over 3,000 problems across 27 algorithms, to assess and analyze the algorithmic reasoning capabilities of large models.

Findings

01

Models perform well on non-optimized tasks (up to 92%)

02

Accuracy drops to around 49% on globally optimized algorithms

03

Models exhibit strategic over-shifts, prematurely abandoning correct solutions

Abstract

Reasoning ability has become a central focus in the advancement of Large Reasoning Models (LRMs). Although notable progress has been achieved on several reasoning benchmarks such as MATH500 and LiveCodeBench, existing benchmarks for algorithmic reasoning remain limited, failing to answer a critical question: Do LRMs truly master algorithmic reasoning? To answer this question, we propose AlgBench, an expert-curated benchmark that evaluates LRMs under an algorithm-centric paradigm. AlgBench consists of over 3,000 original problems spanning 27 algorithms, constructed by ACM algorithmic experts and organized under a comprehensive taxonomy, including Euclidean-structured, non-Euclidean-structured, non-optimized, local-optimized, global-optimized, and heuristic-optimized categories. Empirical evaluations on leading LRMs (e.g., Gemini-3-Pro, DeepSeek-v3.2-Speciale and GPT-o3) reveal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Constraint Satisfaction and Optimization · Multimodal Machine Learning Applications