The normalizing properties of intracranial volume across race and sex
Peirong Liu, Dina Zemlyanker, Karthik Gopinath, You Cheng, Yingnan He, David Izquierdo-Garcia, Teresa Gomez-Isla, Sudeshna Das, Shahin Nasr, Kevin N Sheth, Matthew S Rosen, W Taylor Kimberly, Adam de Havenon, Francis X Shen, Juan Eugenio Iglesias

TL;DR
This study shows that adjusting brain measurements for intracranial volume can eliminate differences between sexes and races, allowing for fair comparisons in MRI-based brain research.
Contribution
The study demonstrates that intracranial volume normalization effectively accounts for sex and race differences in brain volume measurements.
Findings
Intracranial volume differences are consistent across sex and race.
Normalizing brain volumes by intracranial volume removes sex and race differences in MRI studies.
This normalization allows aggregation of diverse groups in brain volumetric analyses.
Abstract
In volumetric analysis of the human brain with MRI, intracranial volume is an important covariate as it has a strong correlation with the volume of regions of interest in the brain. Therefore, accurate adjustment for intracranial volume (e.g. by division) is essential to mitigate the impact of head size on brain measurements. In this study, we assess the effects of intracranial volume normalization on differences across sex and racial groups, using a diverse cohort with 5977 subjects from three different (self-reported) races. We show that (i) differences in intracranial volume across sex are consistent across race and vice versa; and (ii) intracranial volume normalization almost completely accounts for differences by race and sex. These results suggest that subjects of different sexes and races can be safely aggregated in volumetric studies by normalizing the volume of regions of…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3| Race/sex | Female | Male |
|---|---|---|
| Asian | 1058 (age: 48.9 ± 18.6) | 733 (age: 52.3 ± 18.6) |
| Black | 1213 (age: 52.0 ± 18.5) | 833 (age: 53.1 ± 18.1) |
| White | 1179 (age: 55.1 ± 19.0) | 961 (age: 55.4 ± 17.7) |
| Race/sex | Female | Male |
|---|---|---|
| Asian | 1.4078 ± 0.10769 | 1.5753 ± 0.12189 |
| Black | 1.3641 ± 0.10857 | 1.5296 ± 0.12255 |
| White | 1.4319 ± 0.11411 | 1.6144 ± 0.1277 |
- —National Institutes of Health (NIH)10.13039/100006084
- —NIH10.13039/100020940
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neuroimaging Techniques and Applications · Functional Brain Connectivity Studies · MRI in cancer diagnosis
Introduction
MRI is pivotal in imaging of the human brain in vivo, as it provides a non-invasive view into its complex anatomical structure, neural pathways and functional dynamics.^1^ Specifically, morphometric analysis of brain MRI scans enables volumetry of regions of interest (ROIs), which in turn enables the study of structural changes associated with normal aging and various diseases, such as Alzheimer’s disease,^2^ schizophrenia^3^ and traumatic brain injury (TBI).^4^ By measuring alterations in brain volume, researchers can gain insight into disease progression^5^, evaluate treatment efficacy^6^ and identify potential biomarkers,^7^ thus advancing our understanding and management of these disorders.
In conducting volumetric analyses, researchers consider various covariates such as age, sex and other demographic variables.^8^ Among these, intracranial volume (ICV) is particularly important due to its strong correlation with ROI volumes.^9^ Accurate adjustment for ICV is crucial, whether through direct division or as a nuisance variable in general linear models, to account for the influence of head size on brain measurements and ensure precise analysis and interpretation. Even though sex is also commonly included as a covariate in volumetry, the incorporation of ethnicity and race as explanatory variables remains limited.^10^ Reasons include: subjective and potentially inconsistent labelling and measurement of ethnoracial categories; historical misuse of race-based analyses to draw inferences about racial hierarchies; measuring racism and other social determinants of health instead; or interaction with other variables, among others.^11-13^
In this study, we highlight that normalizing for ICV by simple division effectively eliminates most volumetric variations observed across sex and race. We analysed a diverse retrospective cohort with 5977 subjects scanned at Massachusetts General Hospital (MGH) whose scans did not reveal any structural brain abnormalities. Our results show that ICV normalization corrects most of the systematic differences in brain volumes attributed to race and sex.
Materials and methods
Data acquisition
In this study, we used MRI scans from 5977 MGH subjects who were scanned at the hospital between 2014 and 2024. These scans were downloaded from the MGH picture archiving and communication system (PACS) under IRB protocol 2015P001915. The selection criteria were: age between 18 and 90 years; presence of non-surgical brain MRI scans; and absence of structural abnormalities, such as tumours, strokes, or traumatic brain injury—which is automatically determined from their clinical data repository by the MGH Research Patient Data Registry system (RPDR). These criteria yielded a cohort that was as close as possible to a control group. Race is self-reported, and is captured in the electronic health record, and automatically extracted into RPDR. We utilized data from all available Asian and Black subjects satisfying the criteria (1799 and 2051, respectively), as well as 2150 White subjects to complete a sample size of N = 5977.
We note that data was acquired from various scanners at hospitals affiliated with MGB over the past 10 years, including periods of scanner upgrades. Unfortunately, we do not have detailed or reliable information on the specific scanning platforms. Nevertheless, the SynthSeg+ method has been thoroughly validated in this scenario.^14^
Statistical analysis
Automated segmentation is performed using SynthSeg+,^14^ which provides segmentations and volumes for 31 ROIs, including the ICV. Each MRI session may consist of a varying number of 3D scans. We utilize SynthSeg+ as described in its original publication and include only those scans that achieve automatic quality control (QC) scores > 0.75. If more than three scans meet this criterion, we select the top three scans with the highest QC scores. The ROI volumes are then averaged from the scans that pass the QC check, or the top three scans when applicable. For subjects with multiple visits, we average the ROI volumes and ages from these different visits to obtain a single estimate and make the analysis independent of the number of visits. This ensures that the analysis accounts for variability in age and scanning quality across multiple sessions. We note that 23 subjects did not have any scans passing the QC criteria, which led to the final sample size of N = 5977 (Table 1). ICV correction was achieved by direct division—followed by direct multiplication of the average ICV, which preserves the mean volume of the ROIs after normalization. We note that correction by other widespread method (residual, covariate) did not affect the results (please see further details in the ‘Results’ section and the supplementary materials).
When fitting curves, we acknowledge that outliers may arise, either from missegmentations (as SynthSeg+ might occasionally fail), or from the abnormalities that may be missed by RPDR. For this reason, we employ robust statistical analysis. Specifically, we use a Laplacian distribution to model the data, where the location and scale parameters are smooth functions of age. To achieve this, we model the location and scale using B-splines with four control points equally spaced at ages 27, 45, 63 and 81. The fit is optimized with gradient ascent over the log-likelihood function:
where is the probability density function of the Laplace distribution with location μ and scale b; is the volume of the ROI for subject n; is the age of subject n; is a B-spline describing the location, parameterized by ; and is a B-spline describing the scale, parameterized by . Note that the location parameter μ represents the median that minimizes the sum of L1 errors, and the 95% confidence interval is (almost exactly) given by three times the scale, i.e. .
We compute effect sizes at any given age using the pooled (average) scales:
where a is the age, and the subindices 1 and 2 refer to the two compared groups.
Finally, we note that, for the ventricles, we modelled the logarithm of their volume rather than the volume directly, since the distribution of ventricular volumes at any given age is considerably skewed towards higher values.
Results
Table 1 summarizes the sample size and demographics of the cohort used in this study, which includes 5977 subjects from three different racial categories: Asian, Black and White. Table 2 displays ICV values categorized by self-reported race and sex. Crucially, this table shows that the differences between sexes are very consistent across races and vice versa: the male/female ICV ratio is 1.119 for Asian subjects, 1.121 for Black subjects and 1.127 for White subjects; whereas the racial ICV ratios are: 1.032, 0.953 and 0.983 for females; versus 1.029, 0.947 and 0.976 for males. We note that the global male/female ratio (1.12) is highly consistent with previous studies.^15^
Figures 1–3 present a detailed analysis of sex- and race-specific aging trajectories before and after ICV correction. These figures also include effect sizes between groups, as well as aging trajectories using pooled, ICV-corrected data. We show trajectories for the volume of the whole brain, as well as for seven ROIs of interest: hippocampus, amygdala, ventricles, brainstem, thalamus, white matter and cortex. The curves before (Column A) and after (Column B) ICV normalization illustrate the powerful normalization properties of this correction, particularly for some ROIs such as the cortex, white matter, thalamus, or whole brain, which shift from strong differences to almost perfect overlaps. This is also reflected quantitatively in the effect sizes across race and sex, which are reduced by factors between 2 and 8. The pooled figures (Column C), which include the individual data points (with colour-coded race and sex), yield aging trajectories that are generally consistent with prior studies^14^—albeit with a much wider distribution of races. In the supplementary materials, we provide experimental results on additional ICV normalization methods, including division, residual, covariate and matching—following.^16-18^
Aging curves for the whole brain. Aging curves for the whole brain and individual ROIs: comparison of sex- and race-specific trajectories before and after ICV correction, as well as trajectories from pooled, ICV-corrected data (95% confidence interval marked with dashed lines). The vertical lines represent the largest effect size (dmax) across group pairs and ages (N = 5977).
Aging curves for the ventricles, thalamus and hippocampus. Aging curves for the whole brain and individual ROIs: comparison of sex- and race-specific trajectories before and after ICV correction, as well as trajectories from pooled, ICV-corrected data (95% confidence interval marked with dashed lines). The vertical lines represent the largest effect size (dmax) across group pairs and ages (N = 5977).
Aging curves for the white matter, cerebral cortex, brainstem and amygdala. Aging curves for the whole brain and individual ROIs: comparison of sex- and race-specific trajectories before and after ICV correction, as well as trajectories from pooled, ICV-corrected data (95% confidence interval marked with dashed lines). The vertical lines represent the largest effect size (dmax) across group pairs and ages (N = 5977).
Discussion and conclusion
While most volumetric studies of brain ROIs may not thoroughly model ethnoracial factors, our results indicate that this omission may not undermine their results. The implementation of ICV correction—which is widespread in volumetric studies—seems sufficient to address most of the variations due to sex and race. By accounting for intracranial volume differences, ICV normalization effectively mitigates the confounding effects of head size, thereby enhancing the accuracy and comparability of neuroimaging studies across diverse populations. This adjustment not only improves the validity of comparative analyses but also enriches our understanding of how brain structures evolve with age across various demographic groups. It provides a more nuanced perspective on brain morphology by eliminating size-related biases, allowing researchers to focus on the actual structural differences and changes.
Nevertheless, our study has limitations, and further research is required to fully understand the broader implications of demographic factors on brain structure and function. One limitation is that our ‘ground truth’ racial labels are imperfect, due to incomplete data in the medical records, limitations in race self-reporting system (e.g. lack or subgroups or multi-racial options leading subjects of certain groups to more frequently select ‘other’), or failure of staff to appropriately inquire about race.^19^ To mitigate this noise in the racial labels, we used robust statistics based on the Laplace distribution, i.e. using medians instead of means (as Gaussians do). However, future work should use cleaner labels, if possible.
Additionally, we did not consider other potentially relevant confounders, such as years of education,^20^ socioeconomic status or the reason for MRI prescription; these were not consistently or reliably available for our retrospective cohort. Furthermore, the reason for MRI prescription is also highly impractical to model, due to its discrete, heterogeneous nature. Future work will assess whether ICV correction also handles these and other confounders. Further, it will be crucial to study a more comprehensive range of finer ROIs (e.g. cortical areas) and populations (e.g. multi-racial), affected by various diseases. Such analyses are needed to ensure that the findings in this study are applicable across even more diverse demographic groups—eventually leading to a more inclusive and equitable understanding of the human brain in health and in disease.
Supplementary Material
fcaf271_Supplementary_Data
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Grover VP, Tognarelli JM, Crossey MM, Cox IJ, Taylor-Robinson SD, Mc Phail MJ. Magnetic resonance imaging: Principles and techniques: Lessons for clinicians. J Clin Exp Hepatol. 2015;5(3):246–255.26628842 10.1016/j.jceh.2015.08.001PMC 4632105 · doi ↗ · pubmed ↗
- 2Johnson KA, Fox NC, Sperling RA, Klunk WE. Brain imaging in Alzheimer disease. Cold Spring Harb Perspect Med. 2012;2(4):a 006213.22474610 10.1101/cshperspect.a 006213 PMC 3312396 · doi ↗ · pubmed ↗
- 3Shenton ME, Dickey CC, Frumin M, Mc Carley RW. A review of MRI findings in schizophrenia. Schizophr Res. 2001;49(1–2):1–52.10.1016/s 0920-9964(01)00163-3PMC 281201511343862 · doi ↗ · pubmed ↗
- 4Meysami S, Raji CA, Merrill DA, Porter VR, Mendez MF. MRI volumetric quantification in persons with a history of traumatic brain injury and cognitive impairment. J Alzheimer’s Dis. 2019;72(1):293–300.31561375 10.3233/JAD-190708 PMC 7680654 · doi ↗ · pubmed ↗
- 5Boublay N, Bouet R, Dorey J-M, et al Brain volume predicts behavioral and psychological symptoms in Alzheimer’s disease. J Alzheimer’s Dis. 2020;73(4):1343–1353.31903989 10.3233/JAD-190612 · doi ↗ · pubmed ↗
- 6Frisoni GB, Fox NC, Jack CR Jr, Scheltens P, Thompson PM. The clinical use of structural MRI in Alzheimer disease. Nat Rev Neurol. 2010;6(2):67–77.20139996 10.1038/nrneurol.2009.215PMC 2938772 · doi ↗ · pubmed ↗
- 7Harris SE, Cox SR, Bell S, et al Neurology-related protein biomarkers are associated with cognitive ability and brain volume in older age. Nat Commun. 2020;11(1):800.32041957 10.1038/s 41467-019-14161-7PMC 7010796 · doi ↗ · pubmed ↗
- 8Abbs B, Liang L, Makris N, Tsuang M, Seidman LJ, Goldstein JM. Covariance modeling of MRI brain volumes in memory circuitry in schizophrenia: Sex differences are critical. Neuroimage. 2011;56(4):1865–1874.21497198 10.1016/j.neuroimage.2011.03.079PMC 3113542 · doi ↗ · pubmed ↗
