Interpretability Framework for LLMs in Undergraduate Calculus

Sagnik Dakshit; Sushmita Sinha Roy

arXiv:2510.17910·cs.CY·October 22, 2025

Interpretability Framework for LLMs in Undergraduate Calculus

Sagnik Dakshit, Sushmita Sinha Roy

PDF

Open Access

TL;DR

This paper presents a new interpretability framework for analyzing LLM-generated solutions in undergraduate calculus, focusing on reasoning processes, stability, and conceptual correctness, to improve AI transparency in math education.

Contribution

It introduces a structured, quantitative approach combining reasoning flow extraction, semantic decomposition, and prompt ablation to diagnose LLM reasoning in mathematics education.

Findings

01

LLMs often produce fluent but conceptually flawed solutions

02

Reasoning patterns are sensitive to prompt phrasing and input variations

03

The framework enables fine-grained diagnosis of reasoning failures

Abstract

Large Language Models (LLMs) are increasingly being used in education, yet their correctness alone does not capture the quality, reliability, or pedagogical validity of their problem-solving behavior, especially in mathematics, where multistep logic, symbolic reasoning, and conceptual clarity are critical. Conventional evaluation methods largely focus on final answer accuracy and overlook the reasoning process. To address this gap, we introduce a novel interpretability framework for analyzing LLM-generated solutions using undergraduate calculus problems as a representative domain. Our approach combines reasoning flow extraction and decomposing solutions into semantically labeled operations and concepts with prompt ablation analysis to assess input salience and output stability. Using structured metrics such as reasoning complexity, phrase sensitivity, and robustness, we evaluated the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIntelligent Tutoring Systems and Adaptive Learning · Educational Assessment and Pedagogy · Explainable Artificial Intelligence (XAI)