CodeGlance: Understanding Code Reasoning Challenges in LLMs through Multi-Dimensional Feature Analysis

Yunkun Wang; Xuanhe Zhang; Junxiao Han; Chen Zhi; Shuiguang Deng

arXiv:2602.13962·cs.SE·February 17, 2026

CodeGlance: Understanding Code Reasoning Challenges in LLMs through Multi-Dimensional Feature Analysis

Yunkun Wang, Xuanhe Zhang, Junxiao Han, Chen Zhi, Shuiguang Deng

PDF

Open Access

TL;DR

CodeGlance introduces a multi-dimensional benchmark to evaluate LLMs on complex code reasoning tasks involving logic, APIs, and unseen functions, revealing key challenges and the impact of various augmentation strategies.

Contribution

This work presents a novel benchmark and systematic analysis of code reasoning challenges for LLMs across realistic scenarios, highlighting factors affecting performance and strategies for improvement.

Findings

01

Unseen function reasoning is significantly more difficult for smaller models.

02

Code complexity features like trace length and control flow impact reasoning difficulty.

03

Augmentation strategies have varying effectiveness depending on the reasoning challenge.

Abstract

In modern software development, developers frequently need to understand code behavior at a glance -- whether reviewing pull requests, debugging issues, or navigating unfamiliar codebases. This ability to reason about dynamic program behavior is fundamental to effective software engineering and increasingly supported by Large Language Models (LLMs). However, existing studies on code reasoning focus primarily on isolated code snippets, overlooking the complexity of real-world scenarios involving external API interactions and unfamiliar functions. This gap hinders our understanding of what truly makes code reasoning challenging for LLMs across diverse programming contexts. We present CodeGlance, a multi-dimensional benchmark investigating code reasoning challenges across three realistic scenarios: intrinsic logic reasoning, API interaction reasoning, and unseen function reasoning.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Software Engineering Techniques and Practices · Software System Performance and Reliability