DxHF: Providing High-Quality Human Feedback for LLM Alignment via Interactive Decomposition

Danqing Shi; Furui Cheng; Tino Weinkauf; Antti Oulasvirta; Mennatallah El-Assady

arXiv:2507.18802·cs.HC·July 28, 2025

DxHF: Providing High-Quality Human Feedback for LLM Alignment via Interactive Decomposition

Danqing Shi, Furui Cheng, Tino Weinkauf, Antti Oulasvirta, Mennatallah El-Assady

PDF

TL;DR

This paper introduces DxHF, an interactive interface that decomposes long texts into claims to improve human feedback quality for LLM alignment, showing increased accuracy especially under uncertainty.

Contribution

It proposes a novel decomposition-based interface, DxHF, enhancing feedback accuracy and efficiency in LLM alignment tasks compared to traditional methods.

Findings

01

Decomposition improves feedback accuracy, especially with uncertain users.

02

DxHF increases feedback accuracy by 5% on average.

03

Using DxHF slightly increases feedback time by 18 seconds.

Abstract

Human preferences are widely used to align large language models (LLMs) through methods such as reinforcement learning from human feedback (RLHF). However, the current user interfaces require annotators to compare text paragraphs, which is cognitively challenging when the texts are long or unfamiliar. This paper contributes by studying the decomposition principle as an approach to improving the quality of human feedback for LLM alignment. This approach breaks down the text into individual claims instead of directly comparing two long-form text responses. Based on the principle, we build a novel user interface DxHF. It enhances the comparison process by showing decomposed claims, visually encoding the relevance of claims to the conversation and linking similar claims. This allows users to skim through key information and identify differences for better and quicker judgment. Our technical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.