RoMath: A Mathematical Reasoning Benchmark in Romanian

Adrian Cosma; Ana-Maria Bucur; Emilian Radoi

arXiv:2409.11074·cs.CL·May 21, 2025

RoMath: A Mathematical Reasoning Benchmark in Romanian

Adrian Cosma, Ana-Maria Bucur, Emilian Radoi

PDF

Open Access 1 Repo 1 Datasets

TL;DR

RoMath is a new Romanian mathematical reasoning benchmark designed to evaluate and improve AI models' understanding of informal mathematical language in a low-resource language, addressing the gap in multilingual mathematical AI resources.

Contribution

The paper introduces RoMath, the first Romanian mathematical reasoning benchmark with diverse subsets, and benchmarks open-weight models to highlight the need for resources in underrepresented languages.

Findings

01

Open-weight models show varied performance on RoMath

02

RoMath covers multiple difficulty levels and domains

03

Resources for low-resource languages are essential for multilingual AI

Abstract

Mathematics has long been conveyed through natural language, primarily for human understanding. With the rise of mechanized mathematics and proof assistants, there is a growing need to understand informal mathematical text, yet most existing benchmarks focus solely on English, overlooking other languages. This paper introduces RoMath, a Romanian mathematical reasoning benchmark suite comprising three subsets: Baccalaureate, Competitions and Synthetic, which cover a range of mathematical domains and difficulty levels, aiming to improve non-English language models and promote multilingual AI development. By focusing on Romanian, a low-resource language with unique linguistic features, RoMath addresses the limitations of Anglo-centric models and emphasizes the need for dedicated resources beyond simple automatic translation. We benchmark several open-weight language models, highlighting…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cosmaadrian/romath
noneOfficial

Datasets

cosmadrian/romath
dataset· 8 dl
8 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMathematics Education and Teaching Techniques

MethodsFocus