# A data-driven framework linking the connectome to spatial gene expression gradients inspired by chemoaffinity theory

**Authors:** Jigen Koike, Ken Nakae, Riichiro Hira, Yuichiro Yada, Honda Naoki

PMC · DOI: 10.1073/pnas.2516572123 · 2026-03-03

## TL;DR

This paper introduces SPERRFY, a framework that uses gene expression and brain connectivity data to explore how molecular gradients guide brain wiring, extending Sperry's chemoaffinity theory to the whole brain.

## Contribution

SPERRFY is a novel data-driven framework that applies chemoaffinity theory to whole-brain connectomics using spatial gene expression data.

## Key findings

- SPERRFY identifies gradient patterns that capture major aspects of mouse brain connectome wiring.
- Gradient-based connectivity reconstruction shows strong predictive performance and biological relevance.
- The framework screens candidate genes potentially involved in positional wiring of neural circuits.

## Abstract

Understanding how gene expression patterns determine the wiring of the brain is a fundamental question in neuroscience. SPERRFY offers a data-driven framework to decode spatial encoding of whole-brain connectivity by applying multivariate analysis to connectomic data and spatial gene expression patterns, grounded in Sperry’s chemoaffinity theory. Using mouse brain data, we identified specific gradient patterns that capture major aspects of the connectome’s wiring pattern. These findings place connectome-scale wiring patterns in line with key principles of the chemoaffinity theory, and also provide a versatile tool for mapping genetic determinants of brain-wide neural circuits. By integrating transcriptomics with connectomics, our approach potentially opens avenues for exploring the design principles underlying brain architecture.

Understanding how brain-wide neural circuits are genetically wired remains a fundamental question in neuroscience. While Sperry’s chemoaffinity theory [Sperry, Proc. Natl. Acad. Sci. U.S.A.
50, 703–710 (1963)] posits that molecular gradients provide positional cues for axonal projections, its application has been largely limited to localized sensory systems. Here, we present SPERRFY (Spatial Positional Encoding for Reconstructing Rules of axonal Fiber connectivitY), a data-driven framework that operationalizes Sperry’s theory at the whole-brain scale. By integrating connectomic data with spatial transcriptomic profiles from the Allen Mouse Brain Atlas, SPERRFY infers latent positional gradients that underlie axonal wiring. Using canonical correlation analysis (CCA), we extract top gradient pairs that align with observed neural connectivity patterns, capturing both global (interregional) and local (intraregional) organizational principles. Connectivity reconstruction based on these gradients shows strong predictive performance, and permutation-based null models confirm the biological relevance of the inferred structures. Furthermore, SPERRFY can screen for candidate genes that may contribute to positional wiring information, providing molecular insight into the developmental logic of brain-wide circuitry. Our results extend Sperry’s foundational theory beyond the sensory domain, offering a unified, data-driven framework for understanding genetically encoded connectivity across the entire brain.

## Linked entities

- **Species:** Mus musculus (taxon 10090)

## Full-text entities

- **Genes:** Grin2b (glutamate receptor, ionotropic, NMDA2B (epsilon 2)) [NCBI Gene 14812] {aka GluN2B, GluRepsilon2, NR2B, Nmdar2b}, Robo2 (roundabout guidance receptor 2) [NCBI Gene 268902] {aka 2600013A04Rik, 9430089E08Rik, D230004I22Rik, mKIAA1568}, Grm5 (glutamate receptor, metabotropic 5) [NCBI Gene 108071] {aka 6430542K11Rik, Glu5R, Gprc1e, mGluR5, mGluR5b}, Pcdh19 (protocadherin 19) [NCBI Gene 279653] {aka B530002L05Rik, Gm717, mKIAA1313}, Gpbar1 (G protein-coupled bile acid receptor 1) [NCBI Gene 227289] {aka BG37, GPCR, GPR131, M-BAR, TGR5}, Nrg1 (neuregulin 1) [NCBI Gene 211323] {aka 6030402G23Rik, ARIA, D230005F13Rik, GGF, GGFII, HRG}, Efnb2 (ephrin B2) [NCBI Gene 13642] {aka ELF-2, Epl5, Eplg5, Htk-L, LERK-5, Lerk5}, Shh (sonic hedgehog) [NCBI Gene 20423] {aka 9530036O11Rik, Dsh, HHG-1, Hhg1, Hx, Hxl3}, Cpne7 (copine VII) [NCBI Gene 102278], Boc (BOC cell adhesion associated, oncogene regulated) [NCBI Gene 117606] {aka 4732455C11, mFLJ00376}, Epha1 (Eph receptor A1) [NCBI Gene 13835] {aka 5730453L17Rik, Eph, Esk}, Ephb6 (Eph receptor B6) [NCBI Gene 13848] {aka Cekl, Mep}, Nr3c2 (nuclear receptor subfamily 3, group C, member 2) [NCBI Gene 110784] {aka MR, Mlr}, Adgrg1 (adhesion G protein-coupled receptor G1) [NCBI Gene 14766] {aka Cyt28, Gpr56, TM7LN4, TM7XN1}, Dlg4 (discs large MAGUK scaffold protein 4) [NCBI Gene 13385] {aka Dlgh4, PSD-95, PSD95, SAP90, SAP90A}, Camk2a (calcium/calmodulin-dependent protein kinase II alpha) [NCBI Gene 12322] {aka CaMKII, mKIAA0968}
- **Chemicals:** PI (-), PNAS (MESH:D020135)
- **Species:** Homo sapiens (human, species) [taxon 9606], Drosophila melanogaster (fruit fly, species) [taxon 7227], Callitrichinae sp. (species) [taxon 38020], Mus musculus (house mouse, species) [taxon 10090]

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12974521/full.md

---
Source: https://tomesphere.com/paper/PMC12974521