Using General Large Language Models to Classify Mathematical Documents

Patrick D.F. Ion; Stephen M. Watt

arXiv:2406.10274·cs.IR·June 18, 2024

Using General Large Language Models to Classify Mathematical Documents

Patrick D.F. Ion, Stephen M. Watt

PDF

Open Access

TL;DR

This study explores the use of general large language models to classify mathematical documents, demonstrating promising accuracy and potential improvements over existing classifications based on titles and abstracts.

Contribution

It is the first to evaluate general LLMs for classifying mathematical papers using only titles and abstracts, highlighting their potential in mathematical literature navigation.

Findings

01

60% of classifications matched existing labels

02

Half of the matches included additional classifications

03

In 40% of cases, LLMs suggested better classifications

Abstract

In this article we report on an initial exploration to assess the viability of using the general large language models (LLMs), recently made public, to classify mathematical documents. Automated classification would be useful from the applied perspective of improving the navigation of the literature and the more open-ended goal of identifying relations among mathematical results. The Mathematical Subject Classification MSC 2020, from MathSciNet and zbMATH, is widely used and there is a significant corpus of ground truth material in the open literature. We have evaluated the classification of preprint articles from arXiv.org according to MSC 2020. The experiment used only the title and abstract alone -- not the entire paper. Since this was early in the use of chatbots and the development of their APIs, we report here on what was carried out by hand. Of course, the automation of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMathematics, Computing, and Information Processing