Gender “in the wild”: toward a person-specific behavioral neuroendocrinology

Christel Portengen; Esmeralda Hidalgo-Lopez; Ran Yan; Adriene M. Beltz

PMC · DOI:10.1186/s13293-026-00839-3·February 13, 2026

Gender “in the wild”: toward a person-specific behavioral neuroendocrinology

Christel Portengen, Esmeralda Hidalgo-Lopez, Ran Yan, Adriene M. Beltz

PDF

Open Access

TL;DR

This paper argues for studying individual differences in sex and gender effects on behavior using personalized data from daily life observations.

Contribution

The novelty lies in using intensive longitudinal data and idiographic analyses to study the interplay of sex-related neuroendocrinology and gender-related self-concepts in unique individuals.

Findings

01

Mean-based analyses may obscure individual differences in sex and gender effects on behavior.

02

Idiographic approaches reveal unique relations between neuroendocrinology and self-concepts in daily life.

03

Intensive longitudinal data with up to 100 daily assessments can capture individualized patterns.

Abstract

Sex- and gender-related contributions to behavior “in the wild”, as observed in humans in the natural context of their daily lives, can vary strikingly across individuals and be highly enmeshed – so much so that it is impossible to determine whether an average difference between women and men, for instance, reflects biological or sociocultural factors, respectively. Indeed, empirical insights may not just be limited, but may even be distorted, if study designs and data analyses continue to place unique people in ill-assumed homogenous groups for mean-based calculations. Findings may ultimately generalize to no one. An idiographic, or personalized, approach, however, reveals the intricate ways in which sex-related characteristics, such as gonadal hormones, and gender-related experiences combine to matter for behavior. This approach often requires novel data, that is, many repeated…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Species1

Homo sapiens(human · species)

Figures3

Click any figure to enlarge with its caption.

Daily femininity (purple) and masculinity (yellow) scores for six illustrative women in different stages of the menopause transition across 100 days. Dashed lines indicate the intraindividual means (i*M*s), and shaded areas indicate ± 1 intraindividual standard deviation (i*SD*). The left column shows pre-menopausal (top), peri-menopausal (middle), and post-menopausal (bottom) participants with relatively low variability in gender expression (i*SD*s < 0.21). The right column depicts pre-menopausal (top), peri-menopausal (middle), and post-menopausal (bottom) participants with relatively high v

Daily plots (left) and regression plots (right) for two women with varying hormone milieus: One was naturally cycling (2A) and the other was using a triphasic oral contraceptive (2B). The left panels depict femininity (purple) and verbal recall (green) across 75 days, with grey shading indicating days on which the participant was bleeding (2A) or in their inactive pill phase (2B). The right panels (with jitter added for visualization) depict results of the simple slopes analyses, following significant intraindividual interactions between daily femininity and bleeding status on verbal recall (s

Funding3

—https://doi.org/10.13039/100000913James S. McDonnell Foundation
—https://doi.org/10.13039/100005993Institute for Research on Women and Gender, University of Michigan
—https://doi.org/10.13039/501100003986Jacobs Foundation

Keywords

DepressionHormonal contraceptivesIdiographicIntraindividual variationMenopauseMenstrual cyclePubertySensation seekingSex differencesVerbal skills

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSex and Gender in Healthcare · Hypothalamic control of reproductive hormones · Neuroendocrine regulation and behavior

Full text

Introduction

Sex is an often imprecise yet meaningful marker of multifaceted biological processes, including aspects of anatomy (e.g., gonads), genetics (e.g., chromosomes and transcription factors), and hormones (e.g., testosterone and estradiol), which can fluctuate or change in appearance, activity and levels, as well as expression across days (e.g., diurnal rhythm of testosterone), months (e.g., menstrual cycle), and even the life course (e.g., development of secondary sex characteristics associated with puberty) [1, 2]. It has factored prominently in behavioral neuroendocrinology; there are countless special issues, review articles, and meta-analyses delineating average differences between males and females, including the extent to which they are influenced by androgens and estrogens [3–7]. It is hard to understate the value of recent sex differences research: In the United States, women were disturbingly underrepresented in clinical trials prior to a National Institutes of Health (NIH) mandate in 1993, and female animals were largely excluded from pre-clinical research prior to a 2016 NIH mandate [8–11]. Thus, only in the past 30 years has there been some movement away from a science informed almost exclusively by male minds and mannerisms, and equitably toward one that includes female physiques and phenotypes.

Despite the consequential impact of sex differences research for the neuroendocrine health and wellbeing of all people, especially of girls and women [12, 13], fervent academic critics have emerged [14–16]. Nonetheless, average sex differences can be meaningful, marking variability in biological processes that are ultimately important for what makes each person unique. Sex differences even vary in the processes they mark in non-human species, as androgens from the testes do not drive sexual differentiation in all species (e.g., hyenas), there is evidence for male pregnancy in non-mammalian species (e.g., seahorses), and sex chromosome profiles interplay with neuroendocrine processes in white-throated sparrows, resulting in brighter coloration and song in females than in males [17–19]. Clearly, sex differences are not sexist, deterministic, or unmodifiable [20, 21].

Moreover, for humans “in the wild,” that is, as they go about their everyday experiences in their unique environments, gender undoubtedly contributes to the meaning of sex differences. According to the NIH (prior to January 20, 2025) and the European Sex and Gender Equity in Research (SAGER) guidelines, which direct significant scientific funding and publishing portfolios, sex (female/male with variation) is a biological construct, and gender (spectra potentially including man/woman) is a sociocultural one [1, 22]. Gender reflects individual characteristics (e.g., identity), is linked to socialization (e.g., norms), is constrained by exposures (e.g., language), and unfolds in dynamic political contexts that influence the aforementioned expression of personal characteristics and social experiences [23–25]; certainly, “gender” in the United States is different in June 2025 than it was in December 2024.1 Even in animals, sex interacts with environmental exposures and experiences to modify phenotypes, ensuring that biology is not destiny [21]. For instance, temperature determines sexual maturation in some reptilian species (e.g., alligators), and rodent maternal behavior in early life (e.g., licking) influences offspring gene expression and behavior [26, 27].

Sex and gender play vital roles in the complexities of human behavior, and their contributions are wonderfully integrated and conflated (see also [28, 29]); thus, the unequivocal identification of a sex-related versus a gender-related influence on behavior – let alone an unequivocal “sex” or “gender” difference – is often impossible. This is illustrated elegantly in the multidimensional matrix that conceptualizes gender development [30]. In it, four constructs (i.e., concepts or beliefs, identity or self-perception, preferences, and behavioral enactment) are manifested by six different content areas (i.e., biology, activities and interests, personal-social attributes, social relationships, styles and symbols, and values regarding gender), highlighting how even the theoretical underpinnings of development emphasize the interplay between sex- and gender-related factors. Indeed, this interplay is increasingly seen in the scientific corpus via “gender/sex,” “sex/gender,” and “sexually polymorphic” terminology [31, 32], or through explications of “gendered” as an adjective reflecting a complex mix of biological and psychosocial influences on human behavior [33].

Variability in sex- and gender-related factors

As dynamic, multifaceted constructs, variability in sex and gender – and the behaviors with which they are associated – comes as no surprise. Yet, the scientific corpus is overrepresented by mean-based analyses of (often binary) sex and gender differences in behavior. In such analyses, scores on a dependent or outcome variable from men, for example, are averaged and then subtracted from the average score of women; this is the between-group difference. That difference is then divided by a term combining how much men vary from their group average and how much women vary from their group average, incorporating sample size when appropriate; this is the within-group variability. If the between-group difference is large with respect to the within-group variability, then a statistically significant gender difference has been found.

There have been criticisms of this mean-based approach, as it has both commanded and constrained sex and gender science. In fact, there have been explicit calls for increased attention to within-group variability and to the extent of distributional overlap between the scores from men and women (for example [31, 34]). Yet, most statistical indicators of within-group variability rely on group averages. The standard deviation is a common indicator of variability, and it is calculated by first subtracting each person’s score from their group mean. Moreover, between-person, or nomothetic, analyses that average across individuals in order to generalize beyond the sample assume that individuals in a group are homogenous and do not vary over time [35–37]. The calculation of within-group variability and assumptions of nomothetic analyses raise the crucial question: How is a “group” defined for highly variable constructs like sex and gender: those with the same sex assigned at birth, same chromosome complement, same external genitalia, same qualitative gender identity, same rearing environment, same masculine and feminine expression, or some combination of these factors? Although important findings about sex disparities and gender differences have been – and still are to be – uncovered using a mean-based approach, it is high time to recognize that assumptions of homogeneity are oftentimes untenable and that the subsequent utility of between-person analyses are limited in modern sex and gender science.

This conclusion is mathematically supported by the ergodic theorem, which details the conditions that must be met for results from nomothetic studies of between-person variation to generalize to individuals [36]. These conditions are met when often cross-sectional group statistics (e.g., means, variances, and covariances) correspond to the statistics from repeated assessments of an individual over time; this correspondence occurs when individuals in a group do not differ from each other nor change over time, conditions unlikely to occur for most human behaviors related to sex and gender [35–37]. Thus, results from nomothetic studies on sex- and gender-related behaviors likely violate the tenets of the ergodic theorem, and fail to generalize to all individuals. To study sex and gender at the individual-level, an idiographic, or person-specific approach is needed. This approach derives statistical results unique to each person based on their within-person variation – like a quantitative case study. Results from idiographic studies are not intended to generalize across people, but rather, to future timepoints of the same person [35–37].

The multifaceted variability – or heterogeneity – in sex and gender thus requires an idiographic approach, to elucidate which factors matter when, how, and for whom, following the ergodic theorem. It leverages intensive longitudinal data, or a large number of observations on the same variables from the same person, such that there is enough information to fit a statistical model to a single person’s data, as if they were the only participant in a study of N = 1 (and Time, T = many). Intensive longitudinal data are increasingly common in the biological and social sciences; they come in the form of functional neuroimaging scans, wearable passive sensors, ecological momentary assessments, and daily diaries, among other densely repeated measures [38, 39]. There are also a host of techniques for analyzing intensive longitudinal data, ranging from the mass univariate approach to classic timeseries methods to state-of-the-art temporal network analyses and machine learning algorithms, though not all operate at the N = 1 level [40, 41].

Illustrating the promise of an idiographic approach for sex and gender science

The goal of the current work is to illustrate the promise of an idiographic approach for the integration of sex- and gender-related factors in the day-to-day lives of heterogenous individuals; the goal is not to make nomothetic inferences, but rather, to accurately capture the lived experiences of individuals. Specifically, sex- and gender-related variables from select individuals who participated in three different intensive longitudinal studies were analyzed utilizing three different idiographic analytic approaches. All three studies were daily diaries in which individuals responded to a series of questionnaires about their experiences, thoughts, and behaviors in the past 24 h and completed a set of cognitive tasks at the end of each day for 75 or 100 days.

In each illustration, the investigated sex-related factor marked neuroendocrine modulation at a crucial period of life: the menopause transition during aging, the menstrual cycle and oral contraceptive use in young adulthood, and pubertal development in adolescence. These modulations provide indirect indications of neuroendocrine processes that may otherwise be difficult, imprecise, or even impossible to study in humans. For instance, bleeding during the menstrual cycle indicates that ovarian hormone levels are relatively low without requiring repeated blood draws [42], and assessment of secondary sex characteristic development associated with puberty reflects not only hormone levels, but also receptor presence and sensitivity [43, 44]. They serve as “natural experiments”, a widely valued approach in behavioral neuroendocrinology [21, 45, 46].

In all illustrations, the investigated gender-related factor was self-reported gender expression, or daily self-perceptions of masculinity and femininity [47], which concern how individuals enact their gender. It is separate from, but can overlap with, identity as well as other gender-related factors, such as personality, norms, and current social setting [48–50]. This construct was selected because it has been shown to vary across people and contexts in past intensive longitudinal studies, including those using data from illustrations 2 and 3 below [50–53]. The self-perceived femininity and masculinity constructs were assessed daily by the Sex Role Identity Scale (SRIS), modified for daily administration [47]. Specifically, participants rated six items on a scale from 1 (Not at all) to 5 (Extremely) regarding how masculine and then how feminine: (1) they acted, appeared, or came across that day; (2) their personality was that day; and (3) they thought they were in general that day. The terms “feminine” and “masculine” were not defined, as the goal was not to compare constructs across individuals, but rather, to detect variation across study days in whatever the constructs mean to a unique participant. This daily measure has evidenced good-to-excellent within-person (and between-person) reliability in past studies for both femininity and masculinity and for a bipolar gender expression continuum, with findings indicating that the relation between masculinity or femininity and wellbeing domains (e.g., depressive symptoms) varies across samples and likely across contexts and individuals [51, 52].

The three illustrations are presented in order of increasing analytic complexity, and the second and third illustrations include daily assessments of other variables with established sex- or gender-related links (i.e., verbal recall, depressive symptoms, and sensation seeking). Assessment of these variables is discussed within each relevant illustration.

Illustration 1: Fluctuations in daily femininity and masculinity across the menopause transition

Menopause occurs around age 51 years (on average), and is defined as the absence of menses for 12 consecutive months, marking a significant decline in ovarian hormone production in female adults through natural aging [54, 55]. Prior to menopause, however, there can be significant ovarian hormone fluctuation during peri-menopause, which can last up to seven years. This period, which marks the transition from pre-menopause to menopause, is often accompanied by menstrual cycle irregularities, cognitive symptoms, sleep disturbances, and vasomotor symptoms [56]. Thus, the menopause transition reflects a period of significant variability and heterogeneity – both between and within individuals – in neuroendocrine function and its presumed sequelae.

The first illustration explores the person-specific ways in which daily femininity and masculinity fluctuate across the menopause transition. Data were drawn from six women, two in each of three different stages of menopause, who participated in an ongoing 100-day intensive longitudinal study [57] and had response rates of at least 80%, an emerging field standard [37, 58]; the presented data are novel. Specifically, for each menopausal stage, one participant with relatively low, but non-zero, variability in gender expression and one with relatively high variability in gender expression were considered. Menopause status was determined through self-report, and the SRIS (described above) was used to assess daily femininity and masculinity. Separately for each participant, composites were created for each day by averaging the three femininity items and the three masculinity items, and then intraindividual means (iMs) and intraindividual standard deviations (iSDs) were calculated for each. The iM is each individual’s average masculinity or femininity score across 100 days, and the iSD is each individual’s variability around their own iM, with low values reflecting consistent gender expression across study days, and high values reflecting fluctuating expressions across study days.

Illustrative results are shown in Fig. 1, organized by menopause status (rows) and degree of gender fluctuations (columns), and highlight the person-specificity of fluctuations in self-perceived gender expression regardless of menopausal status. Each plot shows study day on the x-axis and gender expression on the y-axis, with daily femininity (purple) and masculinity (yellow) plotted along with iMs (dashed lines) and iSDs (shaded background). Notice that for all women, femininity iMs were higher than masculinity iMs, but they were more disparate for some women (e.g., 1A, 1C, and 1E) than others (e.g., 1B, 1D, and 1F). Also, notice that some women reported little variation in feminine and masculine expression (left column), with 1B and 1C only reporting deviations from their modal femininity scores on 2 of 100 days. Yet, for other women (right column), both masculinity and femininity showed considerable fluctuations across 100 days, meaning that their gender expression waxed and waned, potentially linked with sex- and gender-related factors that can be investigated in future work. Importantly, this means that women with high iSDs could have iMs that may be misleading, as their average levels of self-perceived femininity and masculinity do not describe their daily experiences equally well. For example, although all women in Fig. 1 had femininity iMs greater than their masculinity iMs, for two women (1D and 1F) there were days when their self-perceived masculinity was actually higher than their self-perceived femininity. Thus, these data highlight gender-related individualization through a hormone transition, with noticeable heterogeneity in intraindividual reports of gender expression (marked by the iSD) for unique women experiencing different neuroendocrinological changes surrounding menopause.Fig. 1. Daily femininity (purple) and masculinity (yellow) scores for six illustrative women in different stages of the menopause transition across 100 days. Dashed lines indicate the intraindividual means (iMs), and shaded areas indicate ± 1 intraindividual standard deviation (iSD). The left column shows pre-menopausal (top), peri-menopausal (middle), and post-menopausal (bottom) participants with relatively low variability in gender expression (iSDs < 0.21). The right column depicts pre-menopausal (top), peri-menopausal (middle), and post-menopausal (bottom) participants with relatively high variability (iSDs > 0.50)

Illustration 2: Menstrual cycle modulation of person-specific links between femininity and verbal skills

Ovarian hormones show considerable fluctuations across the menstrual cycle in young adulthood: Estradiol and progesterone are lowest during menstruation and peak in the late follicular and midluteal phases, respectively [42]. Oral contraceptives (OCs) contain synthetic hormones that often suppress ovarian function and its endogenous hormone production [33, 59]. Although formulations vary, most OCs consist of a synthetic estrogen (e.g., ethinyl estradiol) and a synthetic progestin (e.g., levonorgestrel, norethindrone). Generally, OCs contain between 21–28 active pills, and 4–7 inactive (placebo) pills that prompt withdrawal bleeding. This means that people who are naturally cycling and people using OCs experience declines in their endogenous or exogenous hormone levels, respectively, when they are menstruating or in their inactive pill phase.

Past cross-sectional work has linked endogenous and exogenous ovarian hormones, specifically estradiol, to verbal recall (e.g., through menstrual cycle phase or OC use), but not all studies report significant links [45, 60, 61]. Moreover, there are long-standing hypotheses about femininity underlying the sex difference in verbal recall, as females outperform males on average [62, 63]. Combined with the extensive heterogeneity in ovarian hormone levels, sensitivities, and OC formulations, as well as in gender expression, this scenario raises important questions about whether and how sex- and gender-related factors might interact to explain verbal recall – in different ways for different people.

Thus, the second illustration highlights how within-person changes in ovarian hormone exposure (marked by menstrual bleeding or the inactive pill phase) modulate the person-specific link between femininity and verbal recall. Data were from one naturally cycling woman and one OC user (of a triphasic formulation containing ethinyl estradiol and different doses of the progestin norgestimate), who both participated in a 75-day intensive longitudinal study. Data from this study on daily verbal recall and gender expression (among other constructs) have been published separately (see, e.g., [53, 64, 65]); however, the association between expression and recall has not been previously considered, let alone in relation to hormonal milieu. Participants with at least 80% response rates, regular menstrual cycles or consistent OC use across the study, and fluctuations in daily verbal recall and daily femininity were considered. Each day, both participants reported whether they were bleeding, and the OC user additionally reported whether she had taken an active or inactive pill. This was used to index hormone milieu as low (bleeding or inactive pill use = 1) or high (not bleeding or active pill use = 0). Self-perceived femininity was assessed daily with the SRIS by averaging responses on the 1-to-5 Likert scale of the three corresponding items. Verbal recall was assessed with a daily delayed test [64]. In brief, participants were presented with 5 word pairs (each for 2 seconds) at the beginning of each daily survey, and at the end of the survey (~ 15 min later), they were shown the first word and tasked with typing the corresponding second word into an open text box. They received one point for each correct response, totaling a possible 5 points each day.

To examine neuroendocrine modulation of the link between femininity and verbal recall, an intraindividual residualized linear regression was run, separately for each woman. This is essentially a regression analysis in which each study day is an observation (instead of each person being an observation, as would be the case in a traditional between-person analysis), while accounting for autoregressive relations (the previous day variable predicting itself) by using residuals. Specifically, daily verbal recall (outcome) was associated with the daily predictors of femininity, hormonal milieu (indexed by bleeding status for the naturally cycling woman and pill phase for the OC user), and their interaction. Previous studies have used similar approaches to capture intraindividual correlations between gender expression and daily behaviors (e.g., [51–53]).

Illustrative results are shown in Fig. 2 for two participants whose regressions yielded significant interactions, one naturally cycling woman (top, 2A) and one OC user (bottom, 2B). The left plots depict the observed data with day on the x-axis and femininity (purple line) and verbal recall (green line) on the y-axis; the shading indicates days on which the naturally cycling woman was bleeding or the OC user was taking an inactive pill. As seen in the top left, the naturally cycling woman showed moderate day-to-day variability in her self-perceived femininity and verbal recall, generally scoring in the upper half of both scales. Comparatively, the OC user (bottom left) displayed more day-to-day fluctuations in verbal recall and had higher daily scores for self-perceived femininity, primarily fluctuating between 4 and 5.Fig. 2. Daily plots (left) and regression plots (right) for two women with varying hormone milieus: One was naturally cycling (2A) and the other was using a triphasic oral contraceptive (2B). The left panels depict femininity (purple) and verbal recall (green) across 75 days, with grey shading indicating days on which the participant was bleeding (2A) or in their inactive pill phase (2B). The right panels (with jitter added for visualization) depict results of the simple slopes analyses, following significant intraindividual interactions between daily femininity and bleeding status on verbal recall (see Table 1). Black lines reflect the slopes of the femininity-verbal recall relation during high hormone milieus, and grey lines reflect the slopes of the femininity-verbal recall relation during low hormone milieus

Importantly, both women showed a significant interaction between ovarian hormone milieu and femininity on verbal recall. The right plots depict the simple slope analyses decomposing the significant interactions reported in Table 1 from the person-specific regression models, that is, the relation between femininity (x-axis) and verbal recall (y-axis) separately for bleeding/inactive pill days (grey) and not bleeding/active pill days (black). For the naturally cycling woman (top right), there was an inverse relation between self-perceived femininity and verbal recall on days she was bleeding, but a slightly positive relation on days she had higher ovarian hormone levels (i.e., when she was not menstruating). Conversely, the OC user (bottom right) had a positive relation between femininity and verbal recall on days she was using an inactive pill, but an inverse relation on days when she had higher exogenous hormone exposure (i.e., when she was using an active pill). If averaged together, the opposite effects in these two participants may cancel out, potentially helping to explain the small and mixed findings concerning sex- and gender-related contributions to verbal recall in the literature [45, 60, 61]. Idiographic approaches can, thus, advance the field of neuroendocrinology by identifying individuals, such as these, with potential sensitivities to varying hormonal milieus.Table 1. Regression coefficients from the intraindividual residualized regressions estimating the interaction of self-perceived femininity and bleeding status on delayed verbal recall for one naturally cycling female and one oral contraceptive user in a 75-day intensive longitudinal studybSEpNC femaleFemininity0.170.210.411Bleeding status0.020.200.906Bleeding statusfemininity-0.920.430.038OC userFemininity-1.200.450.009Pill phase-0.130.310.673Pill phasefemininity2.300.950.019NC = naturally cycling. OC = oral contraceptive. SE = standard error. Not menstruating and inactive pill phase were the reference groups for bleeding status and pill phase, respectively

Illustration 3: Personalized networks of daily masculinity, femininity, depressive symptoms and sensation seeking during puberty

Puberty is a salient biological process that unfolds in the complex psychosocial context of adolescence. It involves a growth spurt in height, adrenarche, and gonadarche [43, 44]. Adrenarche occurs around ages 6–8 years; it is the maturation of the adrenal glands, resulting in sharp increases of androstenedione as well as dehydroepiandrosterone (DHEA) and its sulfate (DHEAS). It also leads to the development of secondary sex characteristics, such as pubic and axillary hair [43, 66]. Gonadarche has a wider age range of development, beginning around age 9 for females and ages 10–11 for males, and lasts several years; it is the maturation of the gonads, resulting in increased estradiol, progesterone, and menstruation in females and in increased testosterone in males. It also leads to the development of secondary sex characteristics, such as breasts in females and facial hair in males [43, 66].

Puberty marks the beginning of adolescence, a period of increased risk and resilience to psychopathology, and a time when notable gender disparities emerge [67–70]. Adolescent girls are almost twice as likely as boys to develop depression [71], whereas adolescent boys tend to exhibit higher sensation seeking, particularly by late adolescence and continuing into emerging adulthood [72, 73]. Gender expression is also linked to adolescent adjustment, including to depressive symptoms and sensation seeking behaviors [74, 75]. Although between-person studies suggest that cisgender girls with feminine expression and cisgender boys with masculine expression have reduced symptoms, a recent person-specific study explicates that this is not the case for all adolescents, with ~ 53% of modern youth reporting no daily association between gender expression and depressive symptoms [52]. If associations between gender expression and adjustment are heterogenous across adolescents, then an idiographic approach is seemingly mandatory when incorporating the multifaceted neuroendocrine aspects of puberty.

Thus, the final illustration highlights the individualized ways in which daily masculinity and femininity are interrelated with daily depressive symptoms and sensation seeking in four adolescents: two 13-year-olds (one girl and one boy) of similar pubertal stage and two 16-year-old twin brothers. These participants are selectively presented because their matching characteristics (i.e., pubertal status and shared genes plus rearing environments, respectively) provide some study control over extraneous influences and alternative explanations. The data come from a 100-day intensive longitudinal study [52, 76]; data from this study concerning person-specific links between gender expression and depressive symptoms (among other constructs) have already been published, but expression and depression have not been considered in a temporal network analysis [52, 76]. Pubertal status was assessed at the beginning of the study with the self-reported Pubertal Development Scale (PDS; [77]), which contains five items about the development of secondary sex characteristics rated on a 1 (No development) to 4 (Complete development) scale; responses to all items were averaged. The two 13-year-old youth matched on pubertal status had average PDS scores of 2.4, and the 16-year-old twin brothers had the same average PDS score of 2.8. Self-perceived masculinity and femininity were assessed daily with the SRIS using separate composites from the 1-to-5 response scale. Daily depressive symptoms were assessed with an adapted version of the 13-item Short Mood and Feelings Questionnaire (SMFQ; [78]) in which adolescents reported on their feelings that day (e.g., “I did not enjoy anything at all”) on a scale from 0 (Not true) to 2 (True); composites were created by averaging the responses from each adolescent each day, with higher scores reflecting more symptoms. Daily sensation seeking was assessed using an adapted version of the 8-item sensation seeking subscale of the UPPS-P for children [79] in which adolescents reported on the ways they acted and thought in the past 24 hours (e.g., “I wanted new, thrilling things to happen today”) on a scale from 1 (Not at all like me) to 4 (Very much like me); composites were created by averaging the responses from each adolescent each day, with higher scores reflecting higher sensation seeking tendencies.

To examine the daily interplay among the four gender expression and adjustment variables unique for each adolescent, person-specific temporal networks were estimated by fitting individual-level unified structural equation models [80, 81]. Using this approach, a sparse data-driven network was estimated for each participant by iteratively adding optimal (i.e., greatest improvement to model fit according to Lagrange multiplier tests) directed contemporaneous (same-day) and lagged (next-day) relations between pairs variables to the model until it fit well according to established indices (root mean square error of approximation [RMSEA], root mean square residual [SRMR], comparative fit index [CFI], and non-normed fit index [NNFI]) [82]. Autoregressive relations (i.e., a variable’s one-day lagged relation with itself) were estimated for model performance [83], and a hybrid implementation was used, allowing variable residuals to bidirectionally correlate [81], which is relevant when relations may occur through a latent exogenous construct (e.g., masculinity and femininity through gender expression).

The personalized networks for the two pairs of adolescents are shown in Fig. 3, with the results for the 13-year-old girl on the top left (3A), the 13-year-old boy on the right (3B), and the two 16-year-old twin boys on the bottom (3C-D). The networks fit the data well (see the Fig. 3 caption) and display overlap between pairs as well as individuality, as none of the relations between variables are seen in all four person-specific networks. Indeed, notice that each adolescent has their own network with four nodes, aligning with the four daily variables in the analysis. Also, notice that the variables can be associated in three different ways – contemporaneously (solid black lines), lagged (dashed black lines), or with correlated residuals (dotted blue lines) – and that each relation has a beta weight, reflecting its direction (positive or negative) and standardized magnitude.Fig. 3 Person-specific networks of daily masculinity, femininity, depressive symptoms, and sensation seeking for four adolescents who are in mid-puberty: two 13-year-old adolescents (PDS = 2.4) and two 16-year-old male twins (PDS = 2.8). Networks were estimated using hybrid-GIMME without the grouping algorithm. Networks include contemporaneous same-day relations (solid black lines), lagged next-day relations (dashed black lines), and covarying residuals (dotted blue lines) between variables, with standardized relation weights. Each network is unique to that individual and fits the data well: 3A: χ2(14) = 19.04, p = 0.164, RMSEA = 0.060, SRMR = 0.077, NNFI = 0.960, CFI = 0.980; 3B: χ2(12) = 13.29, p = 0.348, RMSEA = 0.033, SRMR = 0.049, NNFI = 0.989, CFI = 0.995; 3C: χ2(11) = 11.71, p = 0.386, RMSEA = 0.026 SRMR = 0.044, NNFI = 0.986, CFI = 0.995; 3D: χ2(12) = 12.76, p = 0.389, RMSEA = 0.025, SRMR = 0.044, NNFI = 0.989, CFI = 0.995

Regarding the 13-year-old adolescents (3A and 3B), both had a contemporaneous link between depression and masculinity, but the links are in opposite directions, with daily increases in masculinity associated with increased symptoms for the girl, but decreases in masculinity associated with increased symptoms for the boy. Both 13-year-old adolescents also had same-day associations between their self-perceived masculinity and femininity; on days they felt more masculine, they also felt less feminine, albeit to differing degrees (based on the magnitude of the relations). The girl (3A) had additional lagged relations of sensation seeking with femininity and with depressive symptoms in her network; lower femininity preceded higher sensation seeking the next day, which in turn, preceded lower depressive symptoms the next day. Interestingly, the boy (3B) had a network with predominantly same-day associations (e.g., of femininity with masculinity and sensation seeking, and of depressive symptoms with masculinity). Only one lagged relation was observed in his network: higher masculinity preceded higher sensation seeking the next day. Importantly, his depressive symptoms and self-perceived femininity also had correlated residuals, potentially indicating that a latent construct might underlie both his daily femininity and depression.

Regarding the twin brothers (3C and 3D), they had several of the same relations: correlated residuals between femininity and sensation seeking as well as relations of depressive symptoms with femininity and with sensation seeking, though the temporal nature and direction differed between brothers. For instance, participant 3C had a pair of positive lagged relations between femininity and depressive symptoms, whereas participant 3D had a negative same-day relation between the variables, such that he concurrently reported higher femininity and lower depressive symptoms. Beyond these relations, participant 3C had correlated residuals between masculinity and sensation seeking and between depressive symptoms and sensation seeking, suggesting that a latent factor might underlie these constructs for this participant. In contrast, participant 3D had an additional lagged relation between masculinity and femininity, with higher masculinity preceding lower femininity the next day, as well as correlated residuals between masculinity and depressive symptoms.

The similarities and differences between the network relations of these two pairs of teens might provide insight into the wonderful integration of some aspects of sex and gender. Specifically, the matching of the teens on age (3A and 3B) and comparing the networks of two twin boys with shared genes and rearing environments (3C and 3D) may contribute to the similarities detected in their networks (e.g., inverse relations between masculinity and femininity; correlated residuals between femininity and sensation seeking), whereas the sex-related nature of pubertal development could contribute to the differences observed between the two 13-year-old teens (e.g., relations between femininity and depressive symptoms and masculinity and sensation seeking). The nuanced differences between the networks of the two twin brothers with respect to daily masculinity showcases the potentially unique ways in which gendered self-concepts may interplay with aspects of mental health. Exploring how these networks evolve over the course of puberty would be a fruitful avenue for future research. This could include studies with burst designs (e.g., three 100-day intensive longitudinal bursts conducted 18 months apart) that capture stability or change in person-specific networks across multiple pubertal stages.

Conclusions

Sex and gender are perennially provocative topics of conversation and research inquiry. Their persistence and pervasiveness undoubtedly reflect their centrality to the human experience for many individuals, and reify the modern necessity of a robust sex and gender science. When variable sex-related factors, such as gonadal hormones, are considered in concert with the multidimensionality of gender, including beliefs, self-perceptions, preferences, and behaviors realized in a range of content areas [30], it becomes evident that an idiographic approach – with many repeated assessments per person that are analyzed in N = 1 statistical models – is among the most promising ways to reveal the person-specific integration of sex and gender, and how this integration often matters for human behavior, including mental health and cognition.

That promise was illustrated in this paper with three examples. Using neuroendocrine-informed intensive longitudinal studies and up to 100 daily assessments of masculine and feminine expression, there were: daily fluctuations in gender expression unique to women irrespective of menopause status; person-specific links between femininity and verbal recall modulated by the menstrual cycle or OC pill phase; and personalized networks of gender expression, depressive symptoms, and sensation seeking for two pairs of adolescents matched on age and pubertal stage (and, for one pair, genes and rearing). These are high fidelity illustrations, utilizing data from participants with excellent daily response rates in an array of intensive longitudinal studies that intentionally indexed hormonal milieu and gender expression as well as sophisticated time series analytic approaches [38–41].

These illustrations emphasize that studying aspects of sex and gender as they dynamically unfold “in the wild” of individuals’ everyday lives requires thoughtful consideration of biopsychosocial constructs as well as of study design and analysis. The design and analytic approach usually depends on the nature of the constructs [84]. Nomothetic approaches can be appropriate for constructs that are homogenous across people and stationary over time; between-person analyses and population generalizations are warranted. Person-centered approaches (e.g., cluster and profile analyses) can be ideal when constructs contain distinct, homogenous person- or time-based clusters within a sample. Idiographic approaches are needed when constructs evidence heterogeneity across people or time (i.e., when the studied process is non-ergodic). Idiographic approaches can ultimately inform theory by revealing how, when, and why sex- and gender-related processes vary across people and time, hopefully leading to more accurate and nuanced theories [37, 84]. They can also hopefully lead to improved prevention, intervention, education, and attainment for sex- and gender-related conditions, experiences, and outcomes. In that way, idiographic approaches align with efforts in precision medicine and mental health care as well as in personalized education and coaching [85–87]: They are not likely needed in every case or context, but they have underappreciated promise for complementing nomothetic and person-centered approaches when averages do not apply to individuals.

These illustrations – and the ideographic approach – also have limitations. They are not comprehensive representations of the datasets from which the participants were drawn, making it unclear if findings generalize to other individuals. Indeed, the goal of idiographic science is not to generalize to other people, but rather, to unmeasured past or future observations of the same person. This facilitates accurate description of one person, but nonetheless raises questions about all other people. To help overcome this limitation, novel approaches like group iterative multiple model estimation (GIMME) have been developed [82]. GIMME offers a way to integrate nomothetic and idiographic analyses by estimating person-specific unified structural equation models (the same models presented in illustration 3) that prioritize the inclusion of sample-level and subgroup-level relations if they exist in a dataset; in other words, relations meaningful for the majority of a sample or subgroup (reflecting homogeneity in between-person patterns) are estimated before person-specific relations (80, 88). GIMME has been found to perform extremely well in simulations of heterogenous or complex data types, while providing indications of patterns expected to generalize across people [82, 83, 88].

Also, the illustrations only consider one aspect of gender (i.e., expression) and often of sex (i.e., hormonal milieu). Other (confounding, latent, third) variables might underlie the relations observed in these illustrations (as seen in the correlated network residuals in illustration 3), but even in those cases, accurate idiographic analyses are required before unique underlying variables can be detected.

There are also limitations specific to each of the analytic approaches showcased in the illustrations. The iSDs calculated in illustration 1 do not differentiate between individuals who have consistently small fluctuations around their iMs versus individuals who have few, large deviations from their iM; the person-specific regressions in illustration 2 do not assess directionality; and the personalized networks in illustration 3 assume that relations between variables are constant across study days. Moreover, many additional analyses could be incorporated into the illustrations, including estimating intraindividual correlations among daily masculinity and femininity in illustration 1, estimating directionality in the relations between daily femininity and verbal recall in illustration 2, or adding the GIMME grouping algorithm to the temporal network models in illustration 3 (for details and tutorials, see [35, 82]). Analytic approaches were chosen based on the research questions suitable for the characteristics of each study, meaning that an analytic approach used in one illustration may not be ideal to apply to data from another illustration.

Finally, collection of the 75 and 100-day intensive longitudinal data shared in the illustrations is time-consuming and intensive, but can be optimized through study procedures (e.g. [37]). Relatedly, some studies show no differences in demographics, physical, or mental health between participants with high versus low compliance rates [52, 89, 90], but some differences have been observed in other studies [76].

Nonetheless, these illustrations demonstrate the potential of an idiographic approach to sex and gender science, as they manifest the intricate, individualized integration of a sex-related and a gender-related factor in everyday life for select people. Humans “in the wild” often experience and express their gender in both similar and unique ways across neuroendocrine milieus and time, and in comparison to others. There are many meaningful ways to integrate and advance sex and gender science, and this paper emphasizes that there is no need to conceptually or statistically collapse across heterogenous constructs and people. Instead, heterogeneity can be embraced with a person-specific approach.

Bibliography1

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Harlow SD, Gass M, Hall JE, Lobo R, Maki P, Rebar RW, et al. Executive summary of the Stages of Reproductive Aging Workshop + 10: Addressing the unfinished agenda of staging reproductive aging. Menopause. 2012;19(4).10.1097/gme.0b 013e 31824 d 8f 40PMC 334090322343510 · doi ↗ · pubmed ↗