Towards Spoken Mathematical Reasoning: Benchmarking Speech-based Models over Multi-faceted Math Problems
Chengwei Wei, Bin Wang, Jung-jae Kim, Nancy F. Chen

TL;DR
This paper introduces Spoken-MQA, a benchmark for evaluating speech-based models' mathematical reasoning, revealing current models' strengths in contextual reasoning but weaknesses in direct arithmetic and verbalized expressions.
Contribution
It presents the first comprehensive benchmark for speech-based mathematical reasoning, assessing both cascade and end-to-end speech LLMs across diverse math problems.
Findings
Speech LLMs perform well on contextual reasoning with basic arithmetic.
Models struggle with direct arithmetic problems.
Current speech LLMs have difficulty interpreting verbalized math expressions.
Abstract
Recent advances in large language models (LLMs) and multimodal LLMs (MLLMs) have led to strong reasoning ability across a wide range of tasks. However, their ability to perform mathematical reasoning from spoken input remains underexplored. Prior studies on speech modality have mostly focused on factual speech understanding or simple audio reasoning tasks, providing limited insight into logical step-by-step reasoning, such as that required for mathematical problem solving. To address this gap, we introduce Spoken Math Question Answering (Spoken-MQA), a new benchmark designed to evaluate the mathematical reasoning capabilities of speech-based models, including both cascade models (ASR + LLMs) and end-to-end speech LLMs. Spoken-MQA covers a diverse set of math problems, including pure arithmetic, single-step and multi-step contextual reasoning, and knowledge-oriented reasoning problems,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Intelligent Tutoring Systems and Adaptive Learning
MethodsSparse Evolutionary Training
