TL;DR
This paper introduces two metrics, Response Pattern Similarity and Action Graph Similarity, to quantify behavioral homogenization in distilled language models, distinguishing mandatory task behaviors from autonomous preferences.
Contribution
It proposes novel metrics to measure non-mandatory behavioral patterns and demonstrates their effectiveness in differentiating model convergence and diversity.
Findings
Within-family model pairs have higher Action Graph Similarity than cross-family pairs.
Kimi-K2 (thinking) exceeds previous models in node and dependency similarity.
AGS effectively distinguishes teacher-specific convergence from general model improvements.
Abstract
Model distillation is a primary driver behind the rapid progress of LLM agents, yet it often leads to behavioral homogenization. Many emerging agents share nearly identical reasoning steps and failure modes, suggesting they may be distilled echoes of a few dominant teachers. Existing metrics, however, fail to distinguish mandatory behaviors required for task success from non-mandatory patterns that reflect a model's autonomous preferences. We propose two complementary metrics to isolate non-mandatory behavioral patterns: \textbf{Response Pattern Similarity (RPS)} for verbal alignment and \textbf{Action Graph Similarity (AGS)} for tool-use habits modeled as directed graphs. Evaluating 18 models from 8 providers on -Bench and -Bench against Claude Sonnet 4.5 (thinking), we find that within-family model pairs score 5.9 pp higher in AGS than cross-family pairs, and that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
