HAL: Inducing Human-likeness in LLMs with Alignment
Masum Hasan, Junjie Zhao, Ehsan Hoque

TL;DR
This paper introduces HAL, a framework that aligns language models to conversational human-likeness using an interpretable, data-driven reward based on explicit conversational traits, improving perceived human-likeness without sacrificing performance.
Contribution
HAL provides a novel, transparent method for aligning LLMs to human-like conversation traits through explicit, interpretable rewards derived from contrastive dialogue data.
Findings
Models aligned with HAL are perceived as more human-like in conversation.
HAL enables inspection and diagnosis of alignment behavior.
The approach works across models of different sizes without performance loss.
Abstract
Conversational human-likeness plays a central role in human-AI interaction, yet it has remained difficult to define, measure, and optimize. As a result, improvements in human-like behavior are largely driven by scale or broad supervised training, rather than targeted alignment. We introduce Human Aligning LLMs (HAL), a framework for aligning language models to conversational human-likeness using an interpretable, data-driven reward. HAL derives explicit conversational traits from contrastive dialogue data, combines them into a compact scalar score, and uses this score as a transparent reward signal for alignment with standard preference optimization methods. Using this approach, we align models of varying sizes without affecting their overall performance. In large-scale human evaluations, models aligned with HAL are more frequently perceived as human-like in conversation. Because HAL…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Artificial Intelligence in Healthcare and Education · Explainable Artificial Intelligence (XAI)
