Inside-Out: Measuring Generalization in Vision Transformers Through Inner Workings

Yunxiang Peng; Mengmeng Ma; Ziyu Yao; Xi Peng

arXiv:2604.08192·cs.LG·April 10, 2026

Inside-Out: Measuring Generalization in Vision Transformers Through Inner Workings

Yunxiang Peng, Mengmeng Ma, Ziyu Yao, Xi Peng

PDF

1 Repo

TL;DR

This paper introduces two novel, circuit-based metrics for evaluating vision transformer generalization, outperforming existing proxies in predicting performance before and after deployment under distribution shifts.

Contribution

It proposes a new approach using internal circuit mechanisms of models as reliable, label-free proxies for generalization performance in vision transformers.

Findings

01

Dependency Depth Bias correlates with model generalization on target data.

02

Circuit Shift Score predicts model performance under distribution shifts.

03

Both metrics outperform existing proxies by over 13% and 34%.

Abstract

Reliable generalization metrics are fundamental to the evaluation of machine learning models. Especially in high-stakes applications where labeled target data are scarce, evaluation of models' generalization performance under distribution shift is a pressing need. We focus on two practical scenarios: (1) Before deployment, how to select the best model for unlabeled target data? (2) After deployment, how to monitor model performance under distribution shift? The central need in both cases is a reliable and label-free proxy metric. Yet existing proxy metrics, such as model confidence or accuracy-on-the-line, are often unreliable as they only assess model output while ignoring the internal mechanisms that produce them. We address this limitation by introducing a new perspective: using the inner workings of a model, i.e., circuits, as a predictive metric of generalization performance.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

deep-real/GenCircuit
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.