Comparing human and LLM politeness strategies in free production
Haoran Zhao, Robert D.Hawkins

TL;DR
This paper compares how humans and large language models (LLMs) use politeness strategies in language, revealing that larger models mimic human preferences but tend to overuse negative politeness, affecting pragmatic alignment.
Contribution
It provides a detailed comparison of human and LLM politeness strategies, highlighting the models' strengths and limitations in pragmatic language use.
Findings
Larger models replicate key politeness preferences
Humans prefer LLM responses in open-ended tasks
Models overuse negative politeness strategies
Abstract
Polite speech poses a fundamental alignment challenge for large language models (LLMs). Humans deploy a rich repertoire of linguistic strategies to balance informational and social goals -- from positive approaches that build rapport (compliments, expressions of interest) to negative strategies that minimize imposition (hedging, indirectness). We investigate whether LLMs employ a similarly context-sensitive repertoire by comparing human and LLM responses in both constrained and open-ended production tasks. We find that larger models (70B parameters) successfully replicate key preferences from the computational pragmatics literature, and human evaluators surprisingly prefer LLM-generated responses in open-ended contexts. However, further linguistic analyses reveal that models disproportionately rely on negative politeness strategies even in positive contexts, potentially leading to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Language and cultural evolution
