Verbosity $\neq$ Veracity: Demystify Verbosity Compensation Behavior of Large Language Models
Yusen Zhang, Sarkar Snigdha Sarathi Das, Rui Zhang

TL;DR
This paper identifies and analyzes Verbosity Compensation in large language models, revealing its prevalence, impact on performance, and connection to uncertainty, and proposes a simple mitigation method to reduce verbose responses.
Contribution
First to define and analyze Verbosity Compensation in LLMs, exploring its causes, effects, and proposing an effective mitigation approach.
Findings
VC is pervasive across all models and datasets, with GPT-4 at 50.40%.
Verbose responses cause a 27.61% performance gap, unaffected by model capability.
Verbose responses are linked to higher uncertainty, highlighting the need for mitigation.
Abstract
Although Large Language Models (LLMs) have demonstrated their strong capabilities in various tasks, recent work has revealed LLMs also exhibit undesirable behaviors, such as hallucination and toxicity, limiting their reliability and broader adoption. In this paper, we discover an understudied type of undesirable behavior of LLMs, which we term Verbosity Compensation (VC), similar to the hesitation behavior of humans under uncertainty, where they respond with excessive words such as repeating questions, introducing ambiguity, or providing excessive enumeration. We present the first work that defines and analyzes Verbosity Compensation, explores its causes, and proposes a simple mitigating approach. Our experiments, conducted on five datasets of knowledge and reasoning-based QA tasks with 14 newly developed LLMs, reveal three conclusions. 1) We reveal a pervasive presence of VC across all…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
MethodsAttention Is All You Need · Linear Layer · Dense Connections · Label Smoothing · Absolute Position Encodings · Layer Normalization · Position-Wise Feed-Forward Layer · Adam · Multi-Head Attention · Residual Connection
