ClaHF: A Human Feedback-inspired Reinforcement Learning Framework for Improving Classification Tasks

Tianxiang Xu; Xiaoyan Zhu; Xin Lai; Jiayin Wang

arXiv:2605.17458·cs.LG·May 19, 2026

ClaHF: A Human Feedback-inspired Reinforcement Learning Framework for Improving Classification Tasks

Tianxiang Xu, Xiaoyan Zhu, Xin Lai, Jiayin Wang

PDF

1 Repo

TL;DR

ClaHF introduces a reinforcement learning framework inspired by human feedback to enhance text classification models by integrating preference modeling and reward optimization without extra human labels.

Contribution

It presents a novel method that converts label supervision into preference signals, improving classification accuracy and calibration across multiple tasks and language models.

Findings

01

Consistently improves classification performance across eight tasks.

02

Enhances confidence calibration in diverse language models.

03

Operates without requiring additional human annotations.

Abstract

Text classification models are typically trained via supervised fine-tuning (SFT). However, SFT essentially performs behavior cloning from instance-wise labels and thus fails to adequately capture relative preference relations among samples, which limits the model's ability to shape decision boundaries and calibrate predictive confidence. In this paper, we propose ClaHF, a human feedback-inspired reinforcement learning (RL) framework for text classification that integrates preference modeling and RL optimization into the classification pipeline without requiring additional human annotations. Unlike prior work that relies solely on instance-wise supervision, ClaHF constructs multiple candidate predictions together with their relative ranking relations, and jointly models the Top-1 preference and the ordering among non-optimal candidates within a reward model (RM). This design converts…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://anonymous.4open.science/r/ClaHF
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.