Beyond Object-Level Alignment: Do Brains and DNNs Preserve the Same Transformations?
Yukiyasu Kamitani

TL;DR
This paper introduces a new method based on category theory to assess whether brains and deep neural networks preserve the same transformations among stimuli, moving beyond traditional stimulus-level alignment.
Contribution
It formalizes the concept of approximate naturality and proposes the Naturality Violation Score (NVS) to evaluate structural preservation in brain-DNN alignment.
Findings
NVS separates different types of alignment failures effectively.
Semantic axes align more strongly with deeper DNN layers and higher visual cortex.
Alignment is selective over candidate transformation families rather than uniform.
Abstract
Brain-DNN alignment is usually assessed through stimulus-level correspondence or stimulus-set geometry. Inspired by category theory, we operationalize a different question: do brain and model preserve the same candidate transformations among stimuli? We formalize this as approximate naturality: if a proxy-defined stimulus change is propagated through the brain side and then translated to the model side, the result should match translating first and then propagating, so that the naturality square approximately commutes. We quantify deviations from commutativity by a Naturality Violation Score (NVS) normalized to a permutation null, shifting alignment from per-stimulus sameness to preservation of structure under an explicitly chosen comparison map. As a proof of concept, a controlled five-factor synthetic setting shows that NVS separates complementary alignment failures that aggregate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
