TextReasoningBench: Does Reasoning Really Improve Text Classification in Large Language Models?
Xinyu Guo, Yazhou Zhang, Jing Qin

TL;DR
This paper introduces TextReasoningBench, a benchmark to evaluate the effectiveness and efficiency of various reasoning strategies in large language models for text classification, revealing limited and inconsistent benefits.
Contribution
It systematically compares seven reasoning strategies across multiple models and datasets, highlighting their limited gains and high token costs in classification tasks.
Findings
Reasoning strategies do not always improve classification accuracy.
Complex reasoning methods often increase token usage significantly without proportional gains.
Simple strategies like CoT provide modest improvements, especially on larger models.
Abstract
Eliciting explicit, step-by-step reasoning traces from large language models (LLMs) has emerged as a dominant paradigm for enhancing model capabilities. Although such reasoning strategies were originally designed for problems requiring explicit multi-step reasoning, they have increasingly been applied to a broad range of NLP tasks. This expansion implicitly assumes that deliberative reasoning uniformly benefits heterogeneous tasks. However, whether such reasoning mechanisms truly benefit classification tasks remains largely underexplored, especially considering their substantial token and time costs. To fill this gap, we introduce TextReasoningBench, a systematic benchmark designed to evaluate the effectiveness and efficiency of reasoning strategies for text classification with LLMs. We compare seven reasoning strategies, namely IO, CoT, SC-CoT, ToT, GoT, BoC, and long-CoT across ten…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational and Text Analysis Methods · Topic Modeling · Text and Document Classification Technologies
