Modulating Cross-Modal Convergence with Single-Stimulus, Intra-Modal Dispersion
Eghbal A. Hosseini, Brian Cheung, Evelina Fedorenko, Alex H. Williams

TL;DR
This paper introduces a method to measure how individual stimuli influence the convergence of neural network representations across modalities, revealing that stimuli with consistent intra-modal responses enhance cross-modal alignment.
Contribution
It presents a novel single-stimulus measurement technique using the Generalized Procrustes Algorithm to analyze intra-modal and cross-modal convergence in neural networks.
Findings
Low intra-modal dispersion stimuli increase cross-modal alignment by up to two times.
The effect of stimulus dispersion on convergence is robust across different model pairings.
Measuring convergence at the single-stimulus level helps understand sources of neural alignment and divergence.
Abstract
Neural networks exhibit a remarkable degree of representational convergence across diverse architectures, training objectives, and even data modalities. This convergence is predictive of alignment with brain representation. A recent hypothesis suggests this arises from learning the underlying structure in the environment in similar ways. However, it is unclear how individual stimuli elicit convergent representations across networks. An image can be perceived in multiple ways and expressed differently using words. Here, we introduce a methodology based on the Generalized Procrustes Algorithm to measure intra-modal representational convergence at the single-stimulus level. We applied this to vision models with distinct training objectives, selecting stimuli based on their degree of alignment (intra-modal dispersion). Crucially, we found that this intra-modal dispersion strongly modulates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
