Two-Stage Reasoning-Infused Learning: Improving Classification with LLM-Generated Reasoning
Mads Henrichsen, Rasmus Krebs

TL;DR
This paper proposes a two-stage learning approach that uses LLM-generated reasoning to improve text classification accuracy, robustness, and interpretability by explicitly incorporating reasoning steps into the training process.
Contribution
It introduces a novel method leveraging LLM-generated reasoning to enhance classification models, demonstrating significant accuracy improvements on emotion classification tasks.
Findings
8.7 percentage point accuracy improvement
Effective use of reasoning to boost model performance
Enhanced interpretability through explicit reasoning
Abstract
Standard classification models often map inputs directly to labels without explicit reasoning, potentially limiting their performance, robustness, and interpretability. This paper introduces a novel two-stage approach to enhance text classification by leveraging Large Language Model (LLM)-generated reasonings. In the first stage, we fine-tune a Llama-3.2-1B-Instruct model (henceforth Llama-R-Gen) on a general-purpose reasoning dataset (syvai/reasoning-gen) to generate textual reasoning (R) given a question and its answer. In the second stage, this generally trained Llama-R-Gen is used offline to create an augmented training dataset for a downstream generative model. This downstream model, based on Llama-3.2-1B-Instruct, takes only the input text (Q) and is trained to output the generated reasoning (R) immediately followed by the predicted emotion (A). We demonstrate this methodology on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Topic Modeling · Explainable Artificial Intelligence (XAI)
