A response to “Realism and robustness require increased sample size when studying both sexes”
Benjamin Phillips, Timo N. Haschler, Natasha A. Karp

Abstract
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSex and Gender in Healthcare
In our original paper [1], we concluded that inclusion of both sexes does not require an increase in sample size by default. Drobniak and colleagues [2] have helpfully highlighted an exception to this general rule, in cases where the variability for a trait differs between the sexes. In these situations, the statistical analysis needs amending to account for the unequal variance and there is a potential power penalty [2]. We would argue however that the general rule still applies, unless there is empirical evidence on a case-by-case basis that suggests that a more complex bespoke power calculation is required, which can then guide the subsequent required sample size.
Drobniak and colleagues [2] assert that “heteroscedasticity between two sexes should be the norm rather than the exception” and provide 2 arguments for this position. Firstly, they cite 2 meta-analyses. We note that the conclusions, within these papers, do not support the default assumption of heteroscedasticity. Indeed, the opposite position (assumption of homoscedasticity) is better supported. For example: “For the average trait of interest to animal researchers, variability does not differ between males and unstaged females” [3] and “the distributions (diff in CV) [explanation added] were extremely leptokurtic, so that many differences were close to 0” [4]. Furthermore, additional studies, be it clinical data [5], behaviour metrics [6], microarray data [7], or data from rats [8] have found similar results. In summary, the literature supports the position that unless there is evidence to the contrary, assume equal variance.
Secondly, Drobniak and colleagues [2] argue that the variance between the sexes will inevitably differ due to Taylor’s law, where the mean and the standard deviation correlate. Consequently, baseline sex differences will inherently lead to variance differences. It is important to note that, for traits that follow Taylor’s law, then any intervention effect (e.g., drug treatment) will also lead to a change in variance between treatment groups. In these situations, it would be inappropriate to fit a standard factorial analysis and the associated statistical power would be irrelevant. As we stated in our original paper [1], it is important to ensure the assumptions hold for your chosen statistical analysis. In biological situations where the mean and variance correlate, a variance stabilising transformation (e.g., log) is an easy applicable strategy to try which should improve the characteristics and allows the application of a standard statistical strategy.
In conclusion, we believe that our original advice still holds. If you have no specific knowledge to the contrary, it is reasonable to proceed assuming homoscedasticity, with no need to increase the overall sample size. If evidence of sex-related heteroscedasticity in the measure of interest is apparent or later emerges, it is important to adapt the statistical analysis plan and power calculation accordingly. As inclusion of both sexes increases, we will start to have the data to conduct more nuanced planning.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Phillips B, Haschler TN, Karp NA. Statistical simulations show that scientists need not increase overall sample size by default when including both sexes in in vivo studies. P Lo S Biol. 2023;21(6):e 3002129. doi: 10.1371/journal.pbio.3002129 37289836 PMC 10284409 · doi ↗ · pubmed ↗
- 2Drobniak SM, Malgorzata Lagisz M, Yang Y, Nakagawa S. Realism and robustness require increased sample size when studying both sexes. P Lo S Biol. 2024; 22(4): e 3002578. doi: 10.1371/journal.pbio.3002578 PMC 1100879338603525 · doi ↗ · pubmed ↗
- 3Prendergast BJ, Onishi KG, Zucker I. Female mice liberated for inclusion in neuroscience and biomedical research. Neurosci Biobehav Rev. 2014;40:1–5. doi: 10.1016/j.neubiorev.2014.01.001 24456941 · doi ↗ · pubmed ↗
- 4Wyman MJ, Rowe L. Male bias in distributions of additive genetic, residual, and phenotypic variances of shared traits. Am Nat. 2014;184(3):326–337. doi: 10.1086/677310 25141142 · doi ↗ · pubmed ↗
- 5Halsey LG, Esteves GP, Dolan E. Variability in variability: does variation in morphological and physiological traits differ between men and women? R Soc Open Sci. 2023;10(9):230713. doi: 10.1098/rsos.230713 37680495 PMC 10480696 · doi ↗ · pubmed ↗
- 6Kaluve AM, Le JT, Graham BM. Female rodents are not more variable than male rodents: A meta-analysis of preclinical studies of fear and anxiety. Neurosci Biobehav Rev. 2022:104962. doi: 10.1016/j.neubiorev.2022.104962 36402227 · doi ↗ · pubmed ↗
- 7Itoh Y, Arnold AP. Are females more variable than males in gene expression? Meta-analysis of microarray datasets. Biol Sex Differ. 2015;6:18. doi: 10.1186/s 13293-015-0036-8 26557976 PMC 4640155 · doi ↗ · pubmed ↗
- 8Becker JB, Prendergast BJ, Liang JW. Female rats are not more variable than male rats: a meta-analysis of neuroscience studies. Biol Sex Differ. 2016;7(1):1–7.27468347 10.1186/s 13293-016-0087-5PMC 4962440 · doi ↗ · pubmed ↗
