TextReasoningBench: Does Reasoning Really Improve Text Classification in Large Language Models?

Xinyu Guo; Yazhou Zhang; Jing Qin

arXiv:2603.19558·cs.CL·March 23, 2026

TextReasoningBench: Does Reasoning Really Improve Text Classification in Large Language Models?

Xinyu Guo, Yazhou Zhang, Jing Qin

PDF

Open Access

TL;DR

This paper introduces TextReasoningBench, a benchmark to evaluate the effectiveness and efficiency of various reasoning strategies in large language models for text classification, revealing limited and inconsistent benefits.

Contribution

It systematically compares seven reasoning strategies across multiple models and datasets, highlighting their limited gains and high token costs in classification tasks.

Findings

01

Reasoning strategies do not always improve classification accuracy.

02

Complex reasoning methods often increase token usage significantly without proportional gains.

03

Simple strategies like CoT provide modest improvements, especially on larger models.

Abstract

Eliciting explicit, step-by-step reasoning traces from large language models (LLMs) has emerged as a dominant paradigm for enhancing model capabilities. Although such reasoning strategies were originally designed for problems requiring explicit multi-step reasoning, they have increasingly been applied to a broad range of NLP tasks. This expansion implicitly assumes that deliberative reasoning uniformly benefits heterogeneous tasks. However, whether such reasoning mechanisms truly benefit classification tasks remains largely underexplored, especially considering their substantial token and time costs. To fill this gap, we introduce TextReasoningBench, a systematic benchmark designed to evaluate the effectiveness and efficiency of reasoning strategies for text classification with LLMs. We compare seven reasoning strategies, namely IO, CoT, SC-CoT, ToT, GoT, BoC, and long-CoT across ten…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational and Text Analysis Methods · Topic Modeling · Text and Document Classification Technologies