From big associations to big practices—Why normative modeling should be the default in personality neuroscience

Peiqian Wu; Yudie Chang

PMC · DOI:10.3389/fpsyg.2025.1701166·November 4, 2025

From big associations to big practices—Why normative modeling should be the default in personality neuroscience

Peiqian Wu, Yudie Chang

PDF

Open Access

Abstract

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Species1

Homo sapiens(human · species)

Diseases2

bipolar disorder schizophrenia

Figures1

Click any figure to enlarge with its caption.

Sample size required to detect small brain–trait effects (80% power, two-sided α = 0.05). Markers indicate r = 0.02/0.03/0.05. Computed via Fisher-z power analysis.

Funding1

—Anhui Normal University10.13039/501100013461

Keywords

personality neurosciencesmall effect sizesheterogeneitynormative modelingbrain-wide associationreproducibilitybig data

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPersonality Traits and Psychology · Personality Disorders and Psychopathology · Functional Brain Connectivity Studies

Full text

Introduction

Early studies reported enticing correlations between Big Five personality traits and brain structure (DeYoung et al., 2010; Kanai and Rees, 2011). Such findings supported the view that stable traits have identifiable neural substrates. However, over a decade of research revealed a more complicated picture: many reported brain-trait associations failed to replicate (e.g., Boekel et al., 2015), and the field struggled with so-called “voodoo correlations,” in which extremely high correlations were likely spurious (Vul et al., 2009). Subsequent critiques underscored that typical sample sizes were underpowered to detect the small effect sizes realistic for personality–brain links (Button et al., 2013; Dubois and Adolphs, 2016).

Small effects and the need for big data

Comprehensive investigations have found little to no evidence of robust brain–personality associations in large samples. For instance, Avinun et al. (2020) examined 1,107 individuals and reported no significant relationships between Big Five traits and multiple measures of brain morphometry. A systematic review and meta-analysis reached a similar conclusion, summarizing that there are no replicable structural brain differences as a function of Big Five traits (Chen and Canli, 2022). These results suggest that true associations, if present, are extremely small in magnitude. Power analyses and empirical work converge on the same message: detecting such tiny effects requires very large samples. Marek et al. (2022) showed that typical brain-wide association studies require thousands of subjects for reproducibility. Figure 1 illustrates this relationship: as effect size decreases, the sample size needed for 80% statistical power rises steeply. In practice, most historical studies in personality neuroscience were far too small, yielding underpowered analyses and spurious positives (Yarkoni, 2009; Button et al., 2013).

Sample size required to detect small brain–trait effects (80% power, two-sided α = 0.05). Markers indicate r = 0.02/0.03/0.05. Computed via Fisher-z power analysis.

Heterogeneity and the case for normative models

Heterogeneity is a key reason why averaging can obscure meaningful effects: individuals with the same trait score can show different neural patterns, and similar brain measures can accompany different trait expressions. Sex-dependent or subgroup-specific associations have been observed (Nostro et al., 2017), and idiosyncratic variation is the rule rather than the exception. Normative modeling provides a principled solution by estimating the expected distribution of brain measures given covariates (e.g., age, sex) and then characterizing each person by their deviation from that norm (Marquand et al., 2016, 2019). Instead of asking whether trait X correlates with region Y on average, we ask whether individuals with extreme trait values show atypical deviations relative to peers. This person-centered approach has already improved sensitivity in clinical domains, revealing heterogeneous deviation patterns in disorders such as schizophrenia and bipolar disorder (Wolfers et al., 2018). Resources like the Brain Charts for the human lifespan demonstrate how large multi-cohort datasets can be used to build robust normative references (Bethlehem et al., 2022).

From big associations to big practices

Adopting normative modeling as a default approach represents a shift from chasing elusive average effects to embracing big practices—robust analytical habits that match problem complexity. Practically, this entails: assembling adequately powered samples via collaboration and data sharing; deriving individual deviation scores for relevant brain measures; testing preregistered, out-of-sample hypotheses about how deviations relate to personality; and reporting full model performance and failures. This agenda pairs naturally with prediction-oriented analysis (Yarkoni and Westfall, 2017), multivariate methods, and open science. It aligns with the idea that personality is about what makes individuals unique—and our methods should quantify that uniqueness instead of averaging it away.

Conclusion

The era of small-N brain–personality correlation hunting is ending. By defaulting to normative modeling, personality neuroscience can better accommodate small effects and heterogeneity, leverage large datasets to establish meaningful baselines, and identify how and why certain individuals or subgroups deviate. Coupled with adequately powered designs and transparent, predictive workflows, this shift from big associations to big practices promises findings that are more robust, generalizable, and practically meaningful.

Bibliography16

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Avinun R.Israel S.Knodt A. R.Hariri A. R. (2020). Little evidence for associations between the Big Five personality traits and variability in brain gray or white matter. Neuro Image 220:117092. 10.1016/j.neuroimage.2020.11709232599267 PMC 7593529 · doi ↗ · pubmed ↗
2Bethlehem R. A. I.Seidlitz J.White S. R.Vogel J. W.Anderson K. M.Adamson C.. (2022). Brain charts for the human lifespan. Nature 604, 525–533. 10.1038/s 41586-022-04554-y 35388223 PMC 9021021 · doi ↗ · pubmed ↗
3Boekel W.Wagenmakers E. J.Belay L.Verhagen J.Brown S.Forstmann B. U. (2015). A purely confirmatory replication study of structural brain–behavior correlations. Cortex 66, 115–133. 10.1016/j.cortex.2014.11.01925684445 · doi ↗ · pubmed ↗
4Button K. S.Ioannidis J. P. A.Mokrysz C.Nosek B. A.Flint J.Robinson E. S. J.. (2013). Power failure: Why small sample size undermines the reliability of neuroscience. Nat. Rev. Neurosci. 14, 365–376. 10.1038/nrn 347523571845 · doi ↗ · pubmed ↗
5Chen Y.-W.Canli T. (2022). “Nothing to see here”: No structural brain differences as a function of the Big Five personality traits from a systematic review and meta-analysis. Personal. Neurosci. 5:e 8. 10.1017/pen.2021.535991756 PMC 9379932 · doi ↗ · pubmed ↗
6De Young C. G.Hirsh J. B.Shane M. S.Papademetris X.Rajeevan N.Gray J. R. (2010). Testing predictions from personality neuroscience: brain structure and the Big Five. Psychol. Sci. 21, 820–828. 10.1177/095679761037015920435951 PMC 3049165 · doi ↗ · pubmed ↗
7Dubois J.Adolphs R. (2016). Building a science of individual differences from f MRI. Trends Cognit. Sci. 20, 425–443. 10.1016/j.tics.2016.03.01427138646 PMC 4886721 · doi ↗ · pubmed ↗
8Kanai R.Rees G. (2011). The structural basis of inter-individual differences in human behaviour and cognition. Nat. Rev. Neurosci. 12, 231–242. 10.1038/nrn 300021407245 · doi ↗ · pubmed ↗