TL;DR
This study investigates whether high test accuracy in model-behavior alignment truly indicates genuine representational similarity, revealing limitations of flexible linear transformations and emphasizing the importance of model identifiability.
Contribution
It demonstrates that even with extensive behavioral data, flexible alignment methods may not reliably identify models that genuinely match human neural representations.
Findings
Model recovery accuracy plateaus below 80% despite large data.
Misidentification linked to shifts in representational geometry.
Flexible metrics may not ensure genuine human-aligned representations.
Abstract
Linearly transforming stimulus representations of deep neural networks yields high-performing models of behavioral and neural responses to complex stimuli. But does the test accuracy of such predictions identify genuine representational alignment? We addressed this question through a large-scale model-recovery study. Twenty diverse vision models were linearly aligned to 4.5 million behavioral judgments from the THINGS odd-one-out dataset and calibrated to reproduce human response variability. For each model in turn, we sampled synthetic responses from its probabilistic predictions, fitted all candidate models to the synthetic data, and tested whether the data-generating model would re-emerge as the best predictor of the simulated data. Model recovery accuracy improved with training-set size but plateaued below 80%, even at millions of simulated trials. Regression analyses linked…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
