VibeCheck: Discover and Quantify Qualitative Differences in Large   Language Models

Lisa Dunlap; Krishna Mandal; Trevor Darrell; Jacob Steinhardt; Joseph; E Gonzalez

arXiv:2410.12851·cs.CL·April 22, 2025

VibeCheck: Discover and Quantify Qualitative Differences in Large Language Models

Lisa Dunlap, Krishna Mandal, Trevor Darrell, Jacob Steinhardt, Joseph, E Gonzalez

PDF

Open Access 1 Repo 1 Datasets 1 Video

TL;DR

VibeCheck is a system that automatically discovers and quantifies subtle, qualitative differences in large language models' outputs, such as tone and style, to better understand their unique characteristics.

Contribution

It introduces a novel iterative method to identify and measure model 'vibes' using human-aligned metrics and applies it to various models and tasks, revealing insightful behavioral differences.

Findings

01

VibeCheck accurately predicts model identity with 80% accuracy.

02

It predicts human preferences with 61% accuracy.

03

The system uncovers distinct vibes like friendliness, overexplaining, and emotional focus across models.

Abstract

Large language models (LLMs) often exhibit subtle yet distinctive characteristics in their outputs that users intuitively recognize, but struggle to quantify. These "vibes" -- such as tone, formatting, or writing style -- influence user preferences, yet traditional evaluations focus primarily on the singular axis of correctness. We introduce VibeCheck, a system for automatically comparing a pair of LLMs by discovering identifying traits of a model (vibes) that are well-defined, differentiating, and user-aligned. VibeCheck iteratively discovers vibes from model outputs and then utilizes a panel of LLM judges to quantitatively measure the utility of each vibe. We validate that the vibes generated by VibeCheck align with those found in human discovery and run VibeCheck on pairwise preference data from real-world user conversations with Llama-3-70b vs GPT-4. VibeCheck reveals that Llama has…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lisadunlap/vibecheck
pytorchOfficial

Datasets

lmarena-ai/Llama-3-70b-battles
dataset· 31 dl
31 dl

Videos

VibeCheck: Discover and Quantify Qualitative Differences in Large Language Models· slideslive

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling

MethodsDense Connections · Layer Normalization · Residual Connection · Position-Wise Feed-Forward Layer · Attention Is All You Need · Adam · Linear Layer · Softmax · Multi-Head Attention · Dropout