Beyond Words: How Large Language Models Perform in Quantitative   Management Problem-Solving

Jonathan Kuzmanko

arXiv:2502.16556·cs.CL·February 25, 2025

Beyond Words: How Large Language Models Perform in Quantitative Management Problem-Solving

Jonathan Kuzmanko

PDF

Open Access

TL;DR

This study evaluates how large language models perform on complex quantitative management problems in a zero-shot setting, revealing strengths in handling multi-step tasks but limitations in accuracy and consistency across models.

Contribution

It provides a comprehensive analysis of LLM capabilities in quantitative decision tasks, highlighting factors affecting performance and comparing multiple models in diverse scenarios.

Findings

01

28.8% of responses were exactly correct

02

Scenario complexity significantly degraded accuracy

03

Performance was stable across repeated queries

Abstract

This study examines how Large Language Models (LLMs) perform when tackling quantitative management decision problems in a zero-shot setting. Drawing on 900 responses generated by five leading models across 20 diverse managerial scenarios, our analysis explores whether these base models can deliver accurate numerical decisions under varying presentation formats, scenario complexities, and repeated attempts. Contrary to prior findings, we observed no significant effects of text presentation format (direct, narrative, or tabular) or text length on accuracy. However, scenario complexity -- particularly in terms of constraints and irrelevant parameters -- strongly influenced performance, often degrading accuracy. Surprisingly, the models handled tasks requiring multiple solution steps more effectively than expected. Notably, only 28.8\% of responses were exactly correct, highlighting…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComplex Systems and Decision Making · Big Data and Business Intelligence

MethodsBalanced Selection