CodeGlance: Understanding Code Reasoning Challenges in LLMs through Multi-Dimensional Feature Analysis
Yunkun Wang, Xuanhe Zhang, Junxiao Han, Chen Zhi, Shuiguang Deng

TL;DR
CodeGlance introduces a multi-dimensional benchmark to evaluate LLMs on complex code reasoning tasks involving logic, APIs, and unseen functions, revealing key challenges and the impact of various augmentation strategies.
Contribution
This work presents a novel benchmark and systematic analysis of code reasoning challenges for LLMs across realistic scenarios, highlighting factors affecting performance and strategies for improvement.
Findings
Unseen function reasoning is significantly more difficult for smaller models.
Code complexity features like trace length and control flow impact reasoning difficulty.
Augmentation strategies have varying effectiveness depending on the reasoning challenge.
Abstract
In modern software development, developers frequently need to understand code behavior at a glance -- whether reviewing pull requests, debugging issues, or navigating unfamiliar codebases. This ability to reason about dynamic program behavior is fundamental to effective software engineering and increasingly supported by Large Language Models (LLMs). However, existing studies on code reasoning focus primarily on isolated code snippets, overlooking the complexity of real-world scenarios involving external API interactions and unfamiliar functions. This gap hinders our understanding of what truly makes code reasoning challenging for LLMs across diverse programming contexts. We present CodeGlance, a multi-dimensional benchmark investigating code reasoning challenges across three realistic scenarios: intrinsic logic reasoning, API interaction reasoning, and unseen function reasoning.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software Engineering Techniques and Practices · Software System Performance and Reliability
