TeleMath: A Benchmark for Large Language Models in Telecom Mathematical Problem Solving
Vincenzo Colle, Mohamed Sana, Nicola Piovesan, Antonio De Domenico, Fadhel Ayed, Merouane Debbah

TL;DR
TeleMath is a new benchmark dataset designed to evaluate large language models' ability to solve mathematically intensive problems specifically in the telecommunications domain, highlighting current models' strengths and limitations.
Contribution
Introduces TeleMath, the first domain-specific benchmark for assessing LLM performance on telecommunications-related mathematical problems, with a comprehensive evaluation of various models.
Findings
Recent models excel in mathematical reasoning tasks.
General-purpose LLMs often underperform on domain-specific problems.
The dataset and evaluation tools are publicly available for future research.
Abstract
The increasing adoption of artificial intelligence in telecommunications has raised interest in the capability of Large Language Models (LLMs) to address domain-specific, mathematically intensive tasks. Although recent advancements have improved the performance of LLMs in general mathematical reasoning, their effectiveness within specialized domains, such as signal processing, network optimization, and performance analysis, remains largely unexplored. To address this gap, we introduce TeleMath, the first benchmark dataset specifically designed to evaluate LLM performance in solving mathematical problems with numerical solutions in the telecommunications domain. Comprising 500 question-answer (QnA) pairs, TeleMath covers a wide spectrum of topics in the telecommunications field. This paper outlines the proposed QnAs generation pipeline, starting from a selected seed of problems crafted…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Text Readability and Simplification · Advanced Graph Neural Networks
