SenseAI: A Human-in-the-Loop Dataset for RLHF-Aligned Financial Sentiment Reasoning

Berny Kabalisa

arXiv:2604.05135·cs.CL·April 8, 2026

SenseAI: A Human-in-the-Loop Dataset for RLHF-Aligned Financial Sentiment Reasoning

Berny Kabalisa

PDF

TL;DR

SenseAI is a comprehensive human-in-the-loop dataset for financial sentiment reasoning that captures reasoning processes, confidence, and real-world outcomes to improve LLM alignment and understanding.

Contribution

The paper introduces SenseAI, a novel dataset with reasoning chains, confidence scores, and correction signals, tailored for RLHF-aligned financial AI model training.

Findings

01

Identifies systematic patterns in model errors, including Latent Reasoning Drift.

02

Reveals confidence miscalibration and forward projection tendencies in models.

03

Supports targeted model improvement through structured HITL data.

Abstract

We introduce SenseAI, a human-in-the-loop (HITL) validated financial sentiment dataset designed to capture not only model outputs but the full reasoning process behind them. Unlike existing resources, SenseAI incorporates reasoning chains, confidence scores, human correction signals, and real-world market outcomes, providing a structure aligned with Reinforcement Learning from Human Feedback (RLHF) paradigms. The dataset consists of 1,439 labelled data points across 40 US-listed equities and 13 financial data categories, enabling direct integration into modern LLM fine-tuning pipelines. Through analysis, we identify several systematic patterns in model behavior, including a novel failure mode we term Latent Reasoning Drift, where models introduce information not grounded in the input, as well as consistent confidence miscalibration and forward projection tendencies. These findings…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.