Evaluating Large Language Models on Solved and Unsolved Problems in Graph Theory: Implications for Computing Education
Adithya Kulkarni, Mohna Chakraborty, Jay Bagga

TL;DR
This study evaluates how large language models assist in understanding and exploring graph theory problems, showing they excel at established concepts but are limited in generating new mathematical insights, with implications for computing education.
Contribution
It provides a detailed analysis of LLM performance on solved and unsolved graph theory problems using an authentic inquiry protocol, highlighting strengths and limitations.
Findings
Strong performance on solved problem with correct proofs
Generated plausible strategies for open problem without fabricating results
Acknowledged uncertainty, avoiding unsupported claims
Abstract
Large Language Models are increasingly used by students to explore advanced material in computer science, including graph theory. As these tools become integrated into undergraduate and graduate coursework, it is important to understand how reliably they support mathematically rigorous thinking. This study examines the performance of a LLM on two related graph theoretic problems: a solved problem concerning the gracefulness of line graphs and an open problem for which no solution is currently known. We use an eight stage evaluation protocol that reflects authentic mathematical inquiry, including interpretation, exploration, strategy formation, and proof construction. The model performed strongly on the solved problem, producing correct definitions, identifying relevant structures, recalling appropriate results without hallucination, and constructing a valid proof confirmed by a graph…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTeaching and Learning Programming · Mathematics Education and Teaching Techniques · Machine Learning in Materials Science
