Verbosity $\neq$ Veracity: Demystify Verbosity Compensation Behavior of   Large Language Models

Yusen Zhang; Sarkar Snigdha Sarathi Das; Rui Zhang

arXiv:2411.07858·cs.CL·December 10, 2024·2 cites

Verbosity $\neq$ Veracity: Demystify Verbosity Compensation Behavior of Large Language Models

Yusen Zhang, Sarkar Snigdha Sarathi Das, Rui Zhang

PDF

Open Access 1 Repo

TL;DR

This paper identifies and analyzes Verbosity Compensation in large language models, revealing its prevalence, impact on performance, and connection to uncertainty, and proposes a simple mitigation method to reduce verbose responses.

Contribution

First to define and analyze Verbosity Compensation in LLMs, exploring its causes, effects, and proposing an effective mitigation approach.

Findings

01

VC is pervasive across all models and datasets, with GPT-4 at 50.40%.

02

Verbose responses cause a 27.61% performance gap, unaffected by model capability.

03

Verbose responses are linked to higher uncertainty, highlighting the need for mitigation.

Abstract

Although Large Language Models (LLMs) have demonstrated their strong capabilities in various tasks, recent work has revealed LLMs also exhibit undesirable behaviors, such as hallucination and toxicity, limiting their reliability and broader adoption. In this paper, we discover an understudied type of undesirable behavior of LLMs, which we term Verbosity Compensation (VC), similar to the hesitation behavior of humans under uncertainty, where they respond with excessive words such as repeating questions, introducing ambiguity, or providing excessive enumeration. We present the first work that defines and analyzes Verbosity Compensation, explores its causes, and proposes a simple mitigating approach. Our experiments, conducted on five datasets of knowledge and reasoning-based QA tasks with 14 newly developed LLMs, reveal three conclusions. 1) We reveal a pervasive presence of VC across all…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

psunlpgroup/verbosityllm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling

MethodsAttention Is All You Need · Linear Layer · Dense Connections · Label Smoothing · Absolute Position Encodings · Layer Normalization · Position-Wise Feed-Forward Layer · Adam · Multi-Head Attention · Residual Connection