Good Neighbors, Bad Neighbors: The Frequent Network Neighborhood Mapping of the Hippocampus Enlightens Several Structural Factors of the Human Intelligence on a 414-Subject Cohort
Mate Fellner, Balint Varga, Vince Grolmusz

TL;DR
This study maps the frequent network neighborhoods of the hippocampus in 414 human brains, revealing structural factors that correlate with intelligence and memory performance, using advanced graph analysis techniques.
Contribution
It introduces the Frequent Network Neighborhood mapping method to identify hippocampal neighbor sets linked to psychological traits in a large cohort.
Findings
Identified hippocampal neighbor sets associated with high intelligence scores.
Found neighbor sets correlated with low memory test scores.
Demonstrated the utility of advanced graph analysis in connectomics research.
Abstract
The human connectome has become the very frequent subject of study of brain-scientists, psychologists, and imaging experts in the last decade. With diffusion magnetic resonance imaging techniques, unified with advanced data processing algorithms, today we are able to compute braingraphs with several hundred, anatomically identified nodes and thousands of edges, corresponding to the anatomical connections of the brain. The analysis of these graphs without refined mathematical tools is hopeless. These tools need to address the high error rate of the MRI processing workflow, and need to find structural causes or at least correlations of psychological properties and cerebral connections. Until now, structural connectomics was only rarely able identifying such causes or correlations. In the present work, we study the frequent neighbor sets of the most deeply investigated brain area, the…
| PMAT24 | 1 | 2 | 3 | 4 | significant | sign. for whom | |
|---|---|---|---|---|---|---|---|
| hippocampus left | high | 39 | 665 | 6646 | 42854 | 2331 | 2328 |
| hippocampus left | low | 41 | 631 | 5164 | 25824 | 3 | |
| hippocampus right | high | 50 | 873 | 8142 | 48521 | 1788 | 1757 |
| hippocampus right | low | 49 | 817 | 7059 | 37558 | 31 | |
| hippocampus | high | 62 | 1325 | 15297 | 113579 | 5345 | 5313 |
| hippocampus | low | 54 | 1036 | 10761 | 70252 | 32 | |
| IWRD | 1 | 2 | 3 | 4 | significant | sign. for whom | |
| hippocampus left | high | 39 | 637 | 5684 | 31139 | 963 | 0 |
| hippocampus left | low | 41 | 691 | 6675 | 41200 | 963 | |
| hippocampus right | high | 47 | 833 | 7663 | 43337 | 456 | 41 |
| hippocampus right | low | 49 | 850 | 7705 | 43918 | 415 | |
| hippocampus | high | 55 | 1082 | 11219 | 72613 | 5484 | 0 |
| hippocampus | low | 62 | 1307 | 15077 | 114860 | 5484 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Good Neighbors, Bad Neighbors: The Frequent Network Neighborhood Mapping of the Hippocampus Enlightens Several Structural Factors of the Human Intelligence on a 414-Subject Cohort
Máté Fellner
Bálint Varga
Vince Grolmusz
PIT Bioinformatics Group, Eötvös University, H-1117 Budapest, Hungary
Uratim Ltd., H-1118 Budapest, Hungary
Abstract
The human connectome has become the very frequent subject of study of brain-scientists, psychologists, and imaging experts in the last decade. With diffusion magnetic resonance imaging techniques, unified with advanced data processing algorithms, today we are able to compute braingraphs with several hundred, anatomically identified nodes and thousands of edges, corresponding to the anatomical connections of the brain. The analysis of these graphs without refined mathematical tools is hopeless. These tools need to address the high error rate of the MRI processing workflow, and need to find structural causes or at least correlations of psychological properties and cerebral connections. Until now, structural connectomics was only rarely able identifying such causes or correlations. In the present work, we study the frequent neighbor sets of the most deeply investigated brain area, the hippocampus. By applying the Frequent Network Neighborhood mapping method, we identified frequent neighbor-sets of the hippocampus, which may influence numerous psychological parameters, including intelligence-related ones. We have found neighbor sets, which have significantly higher frequency in subjects with high-scored Penn Matrix tests, and with low-scored Penn Word Memory tests. Our study utilizes the braingraphs, computed from the imaging data of the Human Connectome Project’s 414 subjects, each with 463 anatomically identified nodes.
Introduction
Our brain contains approximately 80 billion neurons, each connected to hundreds or even thousands of other neurons. All brain functions are closely connected to this network of the brain, frequently called “the connectome” [1, 2, 3]. Today, the neuronal-level connectome (or braingraph), where the nodes correspond to the 80 billion neurons, and two nodes are connected by an edge if the corresponding neurons are connected by an axon, is unknown for us. The only fully developed species with known neuronal-level braingraph is that of the nematode Caenorhabditis elegans, with 302 neurons, determined in the eighties by electron-microscopic techniques ([4], the graph can be downloaded from braingraph.org [5]). More recently, serious developments are reported in the mapping of the neuronal-level braingraph of the fruitfly Drosophila melanogaster with 100,000 neurons [6].
With currently available techniques the human braingraph can be constructed and analyzed in a much coarser resolution than the neuronal level, with the help of diffusion magnetic resonance imaging (MRI) [7]. In these graphs, the nodes are anatomically identified 1-1.5 cm2 areas of the gray matter (frequently addressed as “ROIs”, i.e., Regions Of Interests), and two nodes are connected by an edge if the diffusion MRI analyzing workflow [7, 8, 9, 10] finds axonal fiber tracts between them. Therefore, we can construct today braingraphs upto 1015 nodes and several thousand edges. Perhaps the most reliable large human MRI datasets to date are the public releases of the Human Connectome Project (HCP) [11].
The graph-theoretical analysis of the braingraph
The exact, robust and graph-theoretical analysis of the human braingraphs is a fast developing and important area today. Our research group has contributed numerous results in this field, analyzing the HCP data. We have computed hundreds of braingraphs [5], and prepared the Budapest Reference Connectome Server, which generates the graph of -frequent edges of the human connectome of n=477 people, where , and the -frequent edges are those, which are present in at least braingraphs out of the n=477. The parameter is selectable, along with other parameters at the webserver https://pitgroup.org/connectome/, and the resulting consensus graph can be visualized and downloaded from the site [12, 13].
In the work [14] we have mapped the individually more and less variable lobes of the human brain on 395 subjects, with the help of a natural measure: the distribution function. We have shown that the frontal and the limbic lobes are more conservative, while the edges in the temporal and occipital lobes show more diversity between the individual braingraphs. We have also compared the lobes of the brain by computing numerous graph-theoretical parameters in the sub-graphs, induced by the vertices of the lobes in [15]. We have found that the right temporal and the right parietal lobes have better connectedness-related graph-theoretical parameters than the left ones (e.g., larger minimum vertex cover, larger Hoffman-bound). More interestingly, the left frontal lobe has better such parameters than the right one.
We have compared the volumetric properties of the male and female brain areas in [16], and the sex differences in the human brain connectomes in [17, 18, 19]. We have shown a strong statistical advantage of the female connectomes in the connectedness-related advanced graph-theoretical parameters in a smaller cohort in [17] and in a larger cohort in [18]. In [19] we have clarified that the better, connectedness-related braingraph parameter-results of women cannot be due to the brain-volume differences: we have identified 36 large-brain females and 36 small-brain males, such that the brain volumes of all females were larger in the group than those of all males, and the advantage of the women remained valid even after this highly specific subject selection.
The development of the connections in the mammal brains is a hot research area today with many open questions. Lots of information were learned from embryonic rat and mouse brain microscopy on the development of single neuronal tracts [20, 21]. In the human brain, much less is known about the phases of the axonal development and growth. By analyzing the features of the publicly available Budapest Reference Connectome Server [http:
connectome.pitgroup.org](http:%5C%5C%0Aconnectome.pitgroup.org), we have discovered the phenomenon of the Consensus Connectome Dynamics (CCD), which, by our hypothesis, describes the individual axonal development of the human brain [22, 23, 24, 25]. The CCD phenomenon is also applicable for directing the edges of the braingraph [24, 25].
Robust methods
The robust analysis of the MR imaging data is an important point in all applications, since there are numerous complex steps, where noise or data processing artifacts may appear in the image processing workflow. For example, one such area is the tractography phase, where the crossing axonal fibers may induce errors in the processing [26, 27, 28]. Therefore, the error-correcting analytical methods have an utmost importance in processing of these data.
Our research group pioneered several such methods by examining the frequently appearing substructures. This approach will not consider rarely appearing errors, since if we deal with substructures, which appear with a minimum frequency of 80% or 90%, then the infrequent errors will be filtered out. The Budapest Reference Connectome Server generates the -frequent edges [12, 13]. In the work [29] we have mapped the frequently appearing subgraphs of the human connectome. The frequent complete subgraphs of the human braingraph were identified in [30].
Numerous publications attempt to find correlations between the psychological and anatomical, more exactly, connectomical, or graph-theoretical properties of the braingraph (e.g., [31]). The difficulty of identifying structural-psychological correlations lies in the individual diversity of the cerebral connections. One possible solution to this difficulty is the comparison of the frequent substructures with the results of psychological measurements.
In the publication [32] we defined the Frequent Network Neighborhood Mapping.
The Frequent Network Neighborhood Mapping
Here we would like to formalize the frequent neighborhood mapping. The motivation of the formalism below is the identification of the robust, frequent neighborhoods of some important node , where the word “frequent” means that the same neighborhood of appears frequently in the braingraphs of the N subjects of ours:
Let be a graph with vertex-set and edge-set . Let be a vertex. Vertex is a neighbor of if the unordered pair is an edge of . Then , called the neighbor-set of , contains all the neighbors of vertex , that is:
[TABLE]
Now, let us consider graphs on the very same vertex-set . Let , and let
[TABLE]
In other words, is the neighborhood of in graph .
We say that the vertex-set is a -frequent neighborhood of if there are at least indices , such that . If, say, , then is a frequent neighbor set of with a cut-off value (or threshold) of 80%.
In the work [32] we have identified the frequent neighbor sets of the hippocampus of size at most 4, with threshold of 80%. We have also identified the frequent neighbor-sets of the hippocampus, which were more frequent in male and female subjects, respectively.
Discussion and Results
The hippocampus is, perhaps, the most frequently and deeply investigated area of the brain: it is a part of the limbic system, it has a role in turning short-time memory into long-time memory, in spatial orientation, navigation and memory [33, 34, 35, 36]. It is a sea-horse-shaped organ, and it is present in the left- and also in the right hemispheres: that is, there is a left- and a right hippocampus in the brain.
Here we identify the frequent hippocampus neighbor sets of size up to 4, for hippocampi in both hemispheres. Next, we investigate whether the presence of these neighbors of the hippocampus has any statistical significance with some, intelligence-related test results of the subjects. We call the hippocampus neighbor-sets, with these significant differences in frequencies ”significant sets” in short.
The motivation of this study is as follows: by the best of our knowledge, no connections were proven between the presence or absence of any single connectome-edge and any psychological property of the subjects examined. This failure may be due to the great variability and plasticity of the brain connections [14, 12, 13]. Here we want to overcome these difficulties in a two-fold strategy:
- (i)
Instead of the individual appearances of graph-theoretical objects we consider frequent objects;
- (ii)
Instead of frequent single edges from vertex we consider frequent subsets of the neighbor-set , where is the hippocampus.
Measures of intelligence
In the present study, we consider two psychological tests, which were administered to the subjects of the Human Connectome Project:
PMAT24_A_CR: Penn Matrix Test: Number of Correct Responses; scored from 0 through 24. This is a multiple-choice test where the subject needs to choose the best fit from a list of objects into the one empty position of a small matrix of objects. The PMAT test is believed to assess the mental abstraction and flexibility [37]. The higher scores show better mental abilities. We grouped the scores as “low” between 0 and 16, and “high” between 17 and 24; the cut-off score 17 is the median.
IWRD_TOT: Penn Word Memory Test: Total Number of Correct Responses, scored from 0 through 40. In the first phase of the test, the subjects need to memorize 20 written words. In the recognition phase, 40 words are shown, and the participants need to decide whether the words were seen in the first phase or not. The score is the number of the correct answers. We valued the scores 0-35 as “low” and 36-40 as “high”, the cut-off score 36 is the median.
Table 1 shows the results of the Frequent Network Neighborhood Mapping for these two tests. The table list the numbers of the frequent neighbor sets of the left- and the right hippocampus in the connectomes of the subjects with high- and low PMAT24 and IWRD test scores, respectively.
In the columns, labeled by 1,2,3 and 4 the numbers of the 1,2,3 and 4-element frequent neighbor-sets are given, for the subjects with high and low test scores. The threshold for “frequent” sets is 80% in the cases of both the right- and the left hippocampus, and 90%, when their union is considered. The column with “significant” label contains the number of the neighborhood sets of the statistically differing (p=0.01) frequencies in the “low” and the “high” test scores (called briefly “significant sets”). In the case of PMAT24 tests, the majority of the significant sets are related to the high test values. This may imply that these neighborhoods of the hippocampus are beneficial for the PMAT24 test results, so, these are the “good neighbors” of the hippocampus.
Good and bad neighbors of the hippocampus for the Penn Matrix test
Here we list some neighbor sets with significant differences of frequencies in low- and high-scored PMAT24 subjects. The full lists can be downloaded from http://uratim.com/hc/hc_neighbors_PMAT_IWRD_xls.zip; we refer to the naming conventions of the files there to the ”Data Availability” section below.
The the naming of the nodes below follows those listed in the CMTK nypipe GitHub repository https://github.com/LTS5/cmp_nipype/blob/master/cmtklib/data/parcellation/lausanne2008/ParcellationLausanne2008.xls.
In this test, most of the significant sets are related to the higher scores.
The following three neighbor sets of the left hippocampus have a significantly higher frequency in low-scored PMAT24 subjects:
(lh.bankssts_3)(lh.fusiform_5)(lh.inferiorparietal_4)(lh.insula_1)
(lh.bankssts_3)(lh.insula_1)(lh.lateraloccipital_9)(lh.lingual_8)
(lh.bankssts_3)(lh.fusiform_5)(lh.inferiorparietal_4)(lh.lingual_8)
Two examples from the 2328 significant sets of the neighbors of the left hippocampus, having higher frequencies in the better-scored subjects:
(Left-Caudate)(lh.fusiform_7)(lh.inferiorparietal_5)(lh.isthmuscingulate_2)
(Left-Accumbens-area)(Left-Pallidum)(lh.inferiorparietal_5)(lh.isthmuscingulate_2)
Two examples from the 32 sets neighboring the united left- and right hippocampi, with higher frequencies in lower PMAT24 scored subjects:
(lh.bankssts_3)(lh.fusiform_5)(lh.isthmuscingulate_3)(rh.precuneus_3)
(Right-Thalamus-Proper)(lh.bankssts_3)(lh.fusiform_5)(lh.insula_1)
Good and bad neighbors of the hippocampus for the Penn Word Memory test
In this test, most of the significant sets are related to the lower scores.
Two examples from the 41 neighbor-sets of the right hippocampus with significantly higher frequency in higher-scored Penn Word Memory test subjects.
(Right-Accumbens-area)(Right-Pallidum)(rh.bankssts_3)
(Right-Accumbens-area)(Right-Thalamus-Proper)(rh.bankssts_3)(rh.isthmuscingulate_2)
Two examples from the 963 neighbor-sets of the left hippocampus with significantly higher frequency in lower-scored Penn Word Memory test subjects.
(lh.inferiorparietal_4)(lh.insula_2)(lh.precuneus_11)(lh.supramarginal_1)
(Left-Thalamus-Proper)(lh.inferiorparietal_4)(lh.inferiorparietal_5)(lh.supramarginal_1)
Two examples from the 5484 neighbor-sets of the hippocampus with significantly higher frequency in lower-scored Penn Word Memory test subjects.
(lh.inferiorparietal_4)(lh.inferiorparietal_5)(rh.insula_4)(rh.superiortemporal_1)
(Right-Thalamus-Proper)(lh.inferiorparietal_5)(lh.precuneus_11)(rh.superiortemporal_1)
Materials and Methods
The braingraphs in our work were computed from the MRI data of the Human Connectome Project’s Public Data Release at http://www.humanconnectome.org/documentation/S500 [11]. The subjects of this study were healthy young adults, between the ages of 22 and 35 years. The braingraphs were computed by us, applying the CMTK toolkit [10], with randomized seeding, 1 million streamlines and deterministic tractography. We have used the 463-vertex graphs for the present work. The graphs are available freely for download at our site: https://braingraph.org/cms/download-pit-group-connectomes/. The workflow, by which the graphs were computed from the HCP data is described in details in [5].
The computation of the frequent neighbor sets of the hippocampus, which facilitated the Frequent Network Neighborhood Mapping, used an apriori-like algorithm [38, 39], with small modifications: http://adataanalyst.com/machine-learning/apriori-algorithm-python-3-0/, similarly as in [32]. Succinctly, the apriori algorithm makes use of the following observation: If vertex-set is frequent with a cut-off value, say 80%, then all of the subsets of has a frequency at least of the cut-off value, i.e., 80%. Therefore, the 2-element frequent sets can be built from the 1-element frequent sets, the 3-element frequent sets from the already identified 2-element frequent sets, and so on.
For the identification of the frequent neighbor-sets, we have applied a two-step strategy: First, we partitioned the braingraphs into two groups by the parity of the second digit of the ID of the subjects. Next, we identified the frequent neighbor sets of the hippocampus within both classes of the partition (using the apriori algorithm), with a cut-off value of 80%. Only those sets were accepted to be frequent, which were frequent with cut-off value 80% in both classes. In a certain sense, this strategy modeled the frequency counting in random subsets; consequently, those neighbor sets, which were frequent only in one of the classes, were identified as such.
Statistical Analysis
Next, we analyzed the appearance of the frequent hippocampus neighbor sets in the high- and low-scored PMAT24_A_CR and IWRD_TOT subjects. We identified the neighbor-sets, which were significantly more frequent in the connectomes of the high-scored and the low-scored subjects. For the statistical analysis we used test with significance bound of , with Holm-Bonferroni corrections [40].
Our null hypothesis is that the frequencies are the same in the connectomes of the low- and high scored subjects, and we refute this hypothesis with p=0.01.
For a neighbor set , its occurrences were counted in the low-scored dataset by and in the high-scored dataset by . The support was computed as follows: , where , for , is the number of the subjects with low- and high scores, respectively.
For the significance analysis in the difference of and we used the -test for categorical data:
[TABLE]
Now:
[TABLE]
The degree of freedom for this test is one (since it is the number of samples minus one times the number of categories minus one).
Holm-Bonferroni correction [40]: The p-values for the frequent sets were ordered . For a significance level , let the Holm-Bonferroni value for frequent set be defined as . Then let be the minimum index such that : The null hypotheses for indices need to be rejected.
Conclusions
By the application of Frequent Network Neighborhood Mapping, we examined the neighbors of the human hippocampus, and found that some frequent neighbor sets correlate with the better Penn Matrix test results, and some frequent neighbor sets correlate with worse Penn Word Memory test results. By our knowledge, this is the first result which connects the intelligence-related measures with the neighbors of the human hippocampus, with strict statistical significance analysis.
Data availability
The data source of this study is Human Connectome Project’s Public Data Release at http://www.humanconnectome.org/documentation/S500 [11].
The parcellation data, containing the ROI labels, is listed in the CMTK nypipe GitHub repository https://github.com/LTS5/cmp_nipype/blob/master/cmtklib/data/parcellation/lausanne2008/ParcellationLausanne2008.xls.
The braingraphs can be downloaded from the https://braingraph.org/cms/download-pit-group-connectomes/ site, by choosing the “Full set, 413 brains, 1 million streamlines” option. In the present study, we have used exclusively the 463-node resolution graphs.
The result tables, with the listing of the frequent neighbor sets, whose frequency differ significantly in low- and high scored subjects, can be downloaded in Excel format from the site http://uratim.com/hc/hc_neighbors_PMAT_IWRD_xls.zip. The archive contains 12 files. Six of them has the prefix IWRD_TOT, six of them PMAT24_A_CR, containing the significant sets for these tests. After the prefix, the filenames carry strings hc_l or hc_r or hc, meaning that the neighbor-sets are those of the left- or right hippocampus, or the union of those. Next, the word ”lower” or ”upper” mean that the neighbor sets have significant differences in the frequency in the lower half or the upper half of the scored subjects. Those tables, which correspond to 0s in Table 1 are empty.
Acknowledgments
Data were provided in part by the Human Connectome Project, WU-Minn Consortium (Principal Investigators: David Van Essen and Kamil Ugurbil; 1U54MH091657) funded by the 16 NIH Institutes and Centers that support the NIH Blueprint for Neuroscience Research; and by the McDonnell Center for Systems Neuroscience at Washington University. VG and BV were partially supported by the VEKOP-2.3.2-16-2017-00014 program, supported by the European Union and the State of Hungary, co-financed by the European Regional Development Fund, VG and MF by the NKFI-126472 and NKFI-127909 grants of the National Research, Development and Innovation Office of Hungary. BV and MF was supported in part by the EFOP-3.6.3-VEKOP-16-2017-00002 grant, supported by the European Union, co-financed by the European Social Fund.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Seung [2009] H Sebastian Seung. Reading the book of memory: sparse sampling versus dense mapping of connectomes. Neuron , 62(1):17–29, Apr 2009. doi: 10.1016/j.neuron.2009.03.020 . URL http://dx.doi.org/10.1016/j.neuron.2009.03.020 . · doi ↗
- 2Sporns et al. [2005] Olaf Sporns, Giulio Tononi, and Rolf Kötter. The human connectome: A structural description of the human brain. P Lo S Computational Biology , 1(4):e 42, Sep 2005. doi: 10.1371/journal.pcbi.0010042 . URL http://dx.doi.org/10.1371/journal.pcbi.0010042 . · doi ↗
- 3Lichtman et al. [2008] Jeff W. Lichtman, Jean Livet, and Joshua R. Sanes. A technicolour approach to the connectome. Nature Review Neuroscience , 9(6):417–422, Jun 2008. doi: 10.1038/nrn 2391 . URL http://dx.doi.org/10.1038/nrn 2391 . · doi ↗
- 4White et al. [1986] JG White, E Southgate, JN Thomson, and S Brenner. The structure of the nervous system of the nematode caenorhabditis elegans: the mind of a worm. Phil. Trans. R. Soc. Lond , 314:1–340, 1986.
- 5Kerepesi et al. [2017] Csaba Kerepesi, Balazs Szalkai, Balint Varga, and Vince Grolmusz. The braingraph. org database of high resolution structural connectomes and the brain graph tools. Cognitive Neurodynamics , 11(5):483–486, 2017.
- 6Zheng et al. [2018] Zhihao Zheng, J Scott Lauritzen, Eric Perlman, Camenzind G Robinson, Matthew Nichols, Daniel Milkie, Omar Torrens, John Price, Corey B Fisher, Nadiya Sharifi, Steven A Calle-Schuler, Lucia Kmecova, Iqbal J Ali, Bill Karsh, Eric T Trautman, John A Bogovic, Philipp Hanslovsky, Gregory S X E Jefferis, Michael Kazhdan, Khaled Khairy, Stephan Saalfeld, Richard D Fetter, and Davi D Bock. A complete electron microscopy volume of the brain of adult drosophila melanogaster.
- 7Hagmann et al. [2012] Patric Hagmann, Patricia E. Grant, and Damien A. Fair. MR connectomics: a conceptual framework for studying the developing brain. Front Syst Neurosci , 6:43, 2012. doi: 10.3389/fnsys.2012.00043 . URL http://dx.doi.org/10.3389/fnsys.2012.00043 . · doi ↗
- 8Fischl [2012] Bruce Fischl. Freesurfer. Neuroimage , 62(2):774–781, 2012.
