Using Large Language Models to Study Mathematical Practice
William D'Alessandro

TL;DR
This paper uses large language models to analyze a large corpus of mathematics papers, aiming to understand mathematical explanations and practices, and evaluates the potential of AI tools in philosophy of mathematics research.
Contribution
It introduces a novel corpus analysis approach using LLMs to study mathematical explanations, providing new insights and evaluating AI's role in philosophy of mathematical practice.
Findings
Mathematicians frequently make claims about explanation.
Explanatory practices vary across mathematical subjects.
AI tools can effectively analyze large datasets of mathematical texts.
Abstract
The philosophy of mathematical practice (PMP) looks to evidence from working mathematics to help settle philosophical questions. One prominent program under the PMP banner is the study of explanation in mathematics, which aims to understand what sorts of proofs mathematicians consider explanatory and what role the pursuit of explanation plays in mathematical practice. In an effort to address worries about cherry-picked examples and file-drawer problems in PMP, a handful of authors have recently turned to corpus analysis methods as a promising alternative to small-scale case studies. This paper reports the results from such a corpus study facilitated by Google's Gemini 2.5 Pro, a model whose reasoning capabilities, advances in hallucination control and large context window allow for the accurate analysis of hundreds of pages of text per query. Based on a sample of 5000 mathematics papers…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
