CLSGen: A Dual-Head Fine-Tuning Framework for Joint Probabilistic Classification and Verbalized Explanation

WonJin Yoon; Kangyu Zhu; Ian Bulovic; Autumn Sehy; Yanjun Gao; Dmitriy Dligach; Majid Afshar; Timothy A. Miller

arXiv:2604.11801·cs.CL·April 14, 2026

CLSGen: A Dual-Head Fine-Tuning Framework for Joint Probabilistic Classification and Verbalized Explanation

WonJin Yoon, Kangyu Zhu, Ian Bulovic, Autumn Sehy, Yanjun Gao, Dmitriy Dligach, Majid Afshar, Timothy A. Miller

PDF

TL;DR

CLSGen is a novel fine-tuning framework for large language models that enables reliable probability estimation for binary classification while preserving the ability to generate verbalized explanations.

Contribution

It introduces a new architecture, training methodology, and data strategy that improve probability estimates without losing explanation capabilities.

Findings

01

Outperforms existing baselines in AUROC and F1-score.

02

Shows strong alignment between predictions and explanations.

03

Maintains high readability of generated justifications.

Abstract

With the recent progress of Large Language Models (LLMs), there is a growing interest in applying these models to solve complex and challenging problems. Modern LLMs, capable of processing long contexts and generating verbalized explanations, offer significant potential in addressing real-world applications. However, a critical hurdle in deploying LLMs for practical decision-making is their inability to provide reliable, quantitative probabilities. While task-specific fine-tuning of LLMs using traditional discriminative objectives (similar to encoder-only models) can yield probability estimates, this often leads to catastrophic forgetting and linguistic collapse. Consequently, the model loses its ability to generate explanations, severely undermining its interpretability and usability. To address this challenge, we propose CLSGen, a novel LLM fine-tuning framework designed for binary…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.