When Agents Look the Same: Quantifying Distillation-Induced Similarity in Tool-Use Behaviors

Chenghao Yang; Yuning Zhang; Zhoufutu Wen; Tao Gong; Jiaheng Liu; Qi Chu; Nenghai Yu

arXiv:2604.21255·cs.CL·April 24, 2026

When Agents Look the Same: Quantifying Distillation-Induced Similarity in Tool-Use Behaviors

Chenghao Yang, Yuning Zhang, Zhoufutu Wen, Tao Gong, Jiaheng Liu, Qi Chu, Nenghai Yu

PDF

1 Repo

TL;DR

This paper introduces two metrics, Response Pattern Similarity and Action Graph Similarity, to quantify behavioral homogenization in distilled language models, distinguishing mandatory task behaviors from autonomous preferences.

Contribution

It proposes novel metrics to measure non-mandatory behavioral patterns and demonstrates their effectiveness in differentiating model convergence and diversity.

Findings

01

Within-family model pairs have higher Action Graph Similarity than cross-family pairs.

02

Kimi-K2 (thinking) exceeds previous models in node and dependency similarity.

03

AGS effectively distinguishes teacher-specific convergence from general model improvements.

Abstract

Model distillation is a primary driver behind the rapid progress of LLM agents, yet it often leads to behavioral homogenization. Many emerging agents share nearly identical reasoning steps and failure modes, suggesting they may be distilled echoes of a few dominant teachers. Existing metrics, however, fail to distinguish mandatory behaviors required for task success from non-mandatory patterns that reflect a model's autonomous preferences. We propose two complementary metrics to isolate non-mandatory behavioral patterns: \textbf{Response Pattern Similarity (RPS)} for verbal alignment and \textbf{Action Graph Similarity (AGS)} for tool-use habits modeled as directed graphs. Evaluating 18 models from 8 providers on $τ$ -Bench and $τ^{2}$ -Bench against Claude Sonnet 4.5 (thinking), we find that within-family model pairs score 5.9 pp higher in AGS than cross-family pairs, and that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Syuchin/AgentEcho
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.