Early menarche and breast cancer risk: a systematic review and meta-analysis of 74 case–control studies
Fui-Ling Voon, Edmund Ui-Hang Sim

TL;DR
This study finds that starting menstruation before age 13 is linked to a higher risk of breast cancer, with regional differences in the strength of this link.
Contribution
The paper provides an updated meta-analysis of recent studies on early menarche and breast cancer risk since 2014.
Findings
Menarche before age 13 is associated with a 15% increased breast cancer risk (OR = 1.15).
Women with menarche before age 12 had a 27% higher risk (OR = 1.27).
The strongest association was observed in West Asia (OR = 1.70).
Abstract
Early menarche is a known risk factor for breast cancer, as established by the Collaborative Group on Hormonal Factors in Breast Cancer in a 2012 reanalysis of data from forty-seven global epidemiological studies. Given recent changes in the average age at menarche, it is important to re-evaluate this association in the context of the past decade. This meta-analysis aimed to examine the relationship between early menarche, particularly before age 13, and female breast cancer risk by analyzing recently published observational studies and exploring this relationship across different regions of the world. Case–control studies published between January 2014 and February 2025 were systematically searched in PubMed, ScienceDirect, Scopus, and Google Scholar. Early menarche was defined as < 13 years. Pooled odds ratios (ORs) with 95% confidence intervals (CIs) were calculated using…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6- —Sarawak Digital Economy Corporation (SDEC) Bhd.’s Translational Research Grant Programme
- —Universiti Malaysia Sarawak
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBreast Lesions and Carcinomas · Hypothalamic control of reproductive hormones · Cancer Risks and Factors
Introduction
Breast cancer is the most prevalent cancer among women globally. It is a highly diverse cancer, encompassing several distinct subtypes that are categorized into four groups based on hormone receptors, which include estrogen receptor-positive (ER +), progesterone receptor-positive (PR +), human epidermal growth factor receptor 2-positive (HER2 +), and triple-negative breast cancer (TNBC) [1]. Breast cancer is caused by multiple factors [2], as its incidence, mortality, and survival rates differ significantly across regions, potentially due to variations in population demographics, lifestyle, genetic predispositions, and environmental conditions [3]. While screening can help alleviate the impact of breast cancer, it comes with drawbacks such as side effects, over-diagnosis, and higher costs. One approach to developing safer and more cost-effective targeted screening programs is by categorizing women based on their breast cancer risk factors [4]. Among the common risk factors for breast cancer, such as breast cancer gene 1 (BRCA1) and BRCA2 gene mutations [5], older age [6], family history [7], dense breast tissue [8], and so on, an early onset of menarche, or puberty, is considered to be associated with a higher risk of breast cancer due to the increased number of ovulatory cycles over a lifetime, which results in greater exposure to ovarian hormones and thus a higher risk of breast cancer [9, 10].
Menarche is defined as the initial occurrence of menstruation in a female adolescent. According to research conducted in 67 countries and published between the 1960s and 1990s, the mean age at menarche was determined to be 13.53 years, with a standard deviation of ± 0.98 years [11]. In more recent years, menarche manifests between the ages of 10 and 16, with an average onset age of 12.4 years [12]. Menarche signifies the beginning of fertility, while menstruation, occurring roughly every 28 days (ranging from 21 to 45 days) with a mean interval of 32.2 days in the first gynaecologic year, involves the monthly shedding of the uterine endometrial lining, typically lasting three to seven days, with durations over ten days deemed abnormal [13]. Earlier onset of menarche is associated with stressful family environments, foster care, living with a stepparent, urban upbringing, and high socioeconomic status [14]. Dietary factors such as higher animal protein and lower vegetable protein intake [15], as well as the consumption of sugar-sweetened beverages [16], also contribute to earlier menarche. Additionally, being overweight or obese [17], and formula feeding during infancy [18], have been linked to an earlier onset of menarche.
The potential connection between early menarche and increased breast cancer risk has been extensively studied. The Gail model, presented in a 1989 report from the Breast Cancer Detection and Demonstration Project, highlighted the age of menarche as a significant risk factor for breast cancer. According to this model, women who experienced menarche before the age of 12 had a relative risk of 1.21 compared to those who had menarche after age 14 [19]. Nevertheless, a 2009 report from the Nurses’ Health Study, which mitigated the potential bias of enrolling higher-risk women, identified a smaller increase in relative risk of 1.10 for women who experienced menarche before age 12 [20]. A Moroccan case–control study demonstrated a significant association between early menarche (≤ 13 years) and an increased risk of breast cancer [21]. Additionally, a study found that tall women who experienced early menarche (≤ 13 years) had approximately twice the risk of developing estrogen/progesterone-positive (ER + PR +) tumors [22]. Conversely, a cohort study in the United Kingdom observed a lower risk in women whose menarche occurred at 15 years or older compared to those who began menstruating at ages 13–14 [23]. Interestingly, in this cohort study, menarche at 12 years or younger did not show a correlation with increased breast cancer risk [23].
The present meta-analysis investigates the association between age at menarche and the risk of female breast cancer, with particular focus on menarche occurring before age 13. This cut-off was selected based on a review of the literature and examination of data distributions across various populations [11, 24]. Using a PICO framework, the Population consisted of female breast cancer cases and controls from observational studies, the Intervention/Exposure was early menarche (< 13 years), the Comparison was later menarche (≥ 13 years), and the Outcome was breast cancer risk expressed as pooled odds ratios. Guided by this framework, the review addresses one primary question: Does early menarche, compared with later menarche, increase the risk of developing female breast cancer? By summarizing epidemiological evidence published in the last decade, this study aims to provide updated insights that may strengthen current risk assessment models and inform public health strategies, including those targeting childhood nutrition and body weight management.
Methods
Strategy of literature search
The present study adhered to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines [25]. Peer-reviewed papers were systematically searched across four online databases: ScienceDirect, PubMed, Scopus, and Google Scholar. Relevant articles were identified from the establishment of each database until 28 February 2025. Medical Subject Headings (MeSH) terms and keywords used in the search included (‘early menarche’ or ‘early puberty’) AND (‘breast cancer’ or ‘breast neoplasms’) AND (‘case–control’ or ‘cohort’ or ‘observational study’), applied to the titles and abstracts of the articles.
The full texts of suitable studies were downloaded for further screening. Duplicate copies were removed, and eligible studies were examined based on the inclusion and exclusion criteria. Additionally, backward citation tracking was conducted to ensure all relevant studies were included [26]. Two authors (FLV and EUHS) independently reviewed the full text of all potentially relevant citations (n = 600). Any disagreements during the screening process were resolved through consensus. After selection, both authors independently reviewed the included articles to extract and tabulate data in spreadsheets, with disagreements again resolved by consensus.
A protocol for this systematic review and meta-analysis was not prospectively registered in PROSPERO or any other protocol registry. However, all methodological steps, which include the search strategy, inclusion and exclusion criteria, and statistical analysis plan, were pre-specified and followed consistently throughout the study.
Selection criteria
All articles identified through the literature search were systematically screened for relevance. Titles and abstracts were initially reviewed against predefined inclusion and exclusion criteria, followed by a full-text assessment of potentially eligible studies.
Inclusion criteria
Studies were included if they met the following criteria: (1) observational studies that included female breast cancer case and control populations; (2) reported data on menarche age categorized as < 13 years (e.g., < 13, 11–12, < 11, etc.); and (3) published as original research articles between 1 January 2014 and 28 February 2025; and (4) written in English.
Exclusion criteria
Studies were excluded if they met any of the following criteria: (1) review articles, meta-analyses, or letters to the editor; (2) did not provide data on menarche age < 13 years; (3) published in languages other than English; (4) contained duplicated or overlapping datasets; and (5) lacked sufficient data to perform statistical analyses.
Only studies published between 1 January 2014 and 28 February 2025 were included. This restriction was applied to capture recent epidemiological data that reflect contemporary trends in childhood nutrition, adiposity, environmental exposures, and age at menarche. Older studies were excluded to avoid mixing earlier birth cohorts whose risk profiles and menarche timing may not represent current populations.
Although cohort studies and clinical trials were retrieved during the screening process, they were excluded because they did not provide extractable raw case–control counts for early versus later menarche. The present meta-analysis required raw 2 × 2 data to calculate crude odds ratios consistently across all studies using the Mantel–Haenszel method. Cohort studies generally report adjusted hazard ratios (HR) or relative risks (RR) without presenting underlying raw counts, which prevented their inclusion without additional statistical assumptions or conversions. Clinical trials were likewise screened and excluded, as none provided case–control comparisons relevant to menarche age.
Data extraction
In this review, “early menarche” was defined as menarche occurring before age 13 (e.g., ≤ 12, < 13, or similar thresholds depending on study categorization). This cut-off was selected based on epidemiological evidence indicating that the global mean age at menarche is approximately 12.4–13.5 years [11, 12], and because many large-scale breast cancer studies and prior meta-analyses apply < 13 years as the definition of early menarche. Although individual studies used different categorical ranges (e.g., ≤ 11, 11–12, ≤ 12), all were standardized into early (< 13) and later (≥ 13) menarche categories for pooled analysis. For consistency, participants with menarche at exactly 12 years were included in the early menarche group, and no participants were excluded on this basis. Age at menarche in the included studies was based on self-reported age at first menstruation, as recorded in each original article, which is consistent with standard epidemiological practice in reproductive health research.
After the initial selection of studies based on the inclusion and exclusion criteria, relevant data were extracted for analysis. The data extracted from every selected study included the following: author, year and country of publication, ethnicity, breast cancer subtypes, study design, sample size for case and control, source of subjects, matching criteria for case and control, and menarche age. For consistency and comparability across studies, raw case and control counts (i.e., the number of cases and controls with and without early menarche) were used as the primary data source. This approach was chosen because raw counts allow for a standardized calculation of crude odds ratios (ORs) across studies, ensuring uniformity in effect size estimation. By using raw counts, we minimize the heterogeneity introduced by different reporting formats (such as adjusted odds ratios or hazard ratios), allowing for a more consistent pooling of data in the meta-analysis. The use of raw counts ensures that the analysis is based on the most fundamental data provided in the studies, thereby reducing potential biases associated with adjustments for confounders that vary across studies. This method also avoids complications arising from the use of different statistical models or transformation of effect sizes. Additionally, the methodological quality of each study was assessed using the Newcastle–Ottawa Scale (NOS) [27], and the NOS scores were recorded.
Statistical analysis
The data collected in this meta-analysis were the total number of participants and the number of participants who experienced menarche at the age < 13 in both case and control groups. The overall data were first analyzed as the main global result, and they were then further analyzed in four subgroups: (1) menarche age at < 12 and ≥ 13 years old, (2) study location or geographic region for menarche age at < 13, and (3) source of control, whether population- or hospital-based, for menarche age < 13.
For regional subgroup analyses, studies were categorized by geographic region based on the World Health Organization (WHO) and United Nations (UN) regional classification systems. Countries were grouped as (1) Europe: Italy, France, Germany, Norway, Sweden, Poland, United Kingdom, Spain, Finland; (2) North America: United States, Canada, Mexico; (3) South America: Brazil, Uruguay, Argentina; (4) West Asia: Iran, Saudi Arabia; (5) East Asia: China, Japan; (6) South Asia: India, Pakistan; (7) Southeast Asia: Malaysia, Thailand, Indonesia; (8) Oceania: Australia; (9) Africa: Central African Republic, Morocco, Nigeria. Geographic region was defined by the country where each study was conducted. In multi-country studies, the region of the primary recruitment site or majority population was used.
Dichotomous data were analyzed using the Mantel–Haenszel method and the inverse variance method, yielding odds ratio (OR) with respective 95% CIs and weight for each estimate, and the results were presented in forest plots. The pooled OR was symbolized by a solid diamond at the bottom of each forest plot. Statistical heterogeneity of the studies was assessed using the restricted maximum likelihood (REML) method and Q‐profile method, and the result was expressed as τ^2^ and I^2^ with 95% confidence intervals (CIs), as well as p-value. A common-effect (fixed-effect) model was used when heterogeneity was low (I^2^ ≤ 30%). When heterogeneity was greater than this threshold (I^2^ > 30%), a random-effects model was applied. When I^2^ > 50% or p < 0.10, heterogeneity was considered significant for pooled odds ratios. Egger’s test was applied to assess publication bias via a funnel plot asymmetry [28]. The result of p < 0.05 was considered as an indicator for the potential presence of publication bias. All meta-analyses were performed using R version 4.4.2 [29]. R statistics packages used for data analysis were ‘meta’ version 6.5.0 [30] and ‘metafor’ version 4.4.0 [31].
Results
Characteristics of included studies
Figure 1 illustrates the study selection flowchart for this meta-analysis. Initially, 600 studies were retrieved from the literature search. After identifying and removing 10 duplicate studies, 209 articles were excluded due to being book chapters, review papers, in a foreign language (Chinese), or irrelevant. This left 381 full-text articles for eligibility screening. Among these, 175 were excluded for not being observational studies, and 132 were removed due to incomplete data. Ultimately, 74 articles were included in the analysis, comprising 66,650 cases and 145,172 controls. These 74 studies [32–105] were published between 2014 and 2023. The main characteristics of the selected studies are presented in Supplementary Material 1. Across the 74 case–control studies analyzed, the age at menarche ranged from 12 to 15 + years, with a weighted average of 12.78 years among cases and 12.77 years among controls. Early menarche (before age 13) was observed in 32.27% of cases and 29.81% of controls. The majority of cases (58.16%) and controls (61.00%) experienced menarche between 13 and 14 years, while fewer cases (9.57%) and controls (9.19%) reported menarche at age 15 and above.Fig. 1. The Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) flow chart of 74 included studies in the meta-analysis
Meta-analyses
The results of each meta-analysis were presented in Figs. 2, 3, 4, 5, 6. The pooled OR for overall data from 74 studies was 1.15 (95% CI: 1.08–1.24) as portrayed in Fig. 2, indicating an association between menarche before age 13 and an increased risk of female cancer across the studies. Further analysis of the subgroup with menarche before age 12 yielded a pooled OR of 1.27 (95% CI: 1.09–1.48). For the subgroup with menarche at age 13 or older (Fig. 3), the pooled OR was 0.89 (95% CI: 0.85 – 0.94), suggesting a lower risk in this group.Fig. 2. The forest plot depicting the pooled odds ratio (OR) from 74 studies worldwide, illustrating the association between menarche at age < 13 and the risk of female breast cancer. In this plot, “Case” represents the group of breast cancer patients, “Control” indicates the group of individuals without breast cancer, and “Events” signifies the number of cases or controls who experienced menarche before the age of 13Fig. 3The forest plot displaying the pooled odds ratio (OR) for the association between menarche at age ≥ 13 and the risk of female breast cancer. In this plot, “Case” represents the group of breast cancer patients, “Control” refers to the group of individuals without breast cancer, and “Events” indicates the number of cases or controls who experienced menarche at age ≥ 13Fig. 4The forest plot illustrates the pooled odds ratio (OR) for the association between menarche before age 13 and the risk of female breast cancer across four global regions: West Asia, Europe, Oceania, and North America. In this plot, “Case” identifies individuals with breast cancer, “Control” represents those without breast cancer, and “Events” denotes the number of people in each group who had menarche before age 13Fig. 5Forest plots showing regional associations between menarche before age 13 and breast cancer risk in Africa, East Asia, South America, South Asia, and Southeast Asia. The wider confidence intervals observed in these analyses indicate reduced statistical precision due to smaller sample sizes and fewer studiesFig. 6Contour-enhanced funnel plot of included studies. The plot is centered on a log odds ratio of zero (null effect), with dots representing the dispersion of effect sizes on the x-axis against the standard error on the y-axis. Different shading colors indicate different p-value ranges: Dots in the unshaded (white) region correspond to p-values > 0.1, the light-gray shaded region indicates p-values between 0.1 and 0.05, the dark-gray region indicates p-values between 0.05 and 0.01, and the region outside the funnel corresponds to p-values below 0.01
In the subgroup analysis by region, the pooled ORs for Europe (OR = 1.15, 95% CI: 1.07–1.25), North America (OR = 1.07, 95% CI: 1.03–1.11), Oceania (OR = 1.15, 95% CI: 1.02–1.30), and West Asia (OR = 1.70, 95% CI: 1.15–2.52) were consistent with regional differences in the association between menarche age and breast cancer risk. These results are depicted in Fig. 4.
Associations that were not statistically significant were observed in Africa (OR = 0.84, 95% CI: 0.22 – 3.18), East Asia (OR = 1.13, 95% CI: 0.95 – 1.34), South America (OR = 0.91, 95% CI: 0.77 – 1.09), South Asia (OR = 1.29, 95% CI: 0.65 – 2.56), and Southeast Asia (OR = 0.85, 95% CI: 0.62 – 1.16). Although the overall direction of effect was generally consistent with the global trend, the magnitude of association was small and the precision of estimates varied across regions, reflecting the limited number of studies and wide confidence intervals in some subgroups. To improve interpretability, the results for South Asia and East Asia are presented separately, highlighting potential regional differences in sample size and population characteristics. The forest plots for these regions are shown in Fig. 5.
Subgroup analysis based on the source of subjects showed associations in both population-based and hospital-based study designs. In the population-based subgroup, the pooled OR was 1.11 (95% CI: 1.08 – 1.14), while the hospital-based subgroup had a pooled OR of 1.26 (95% CI: 1.06 – 1.51).
A summary of all analyses is available in Table 1, and the data extracted from the 74 studies can be found in Supplementary Material 2.Table 1. Pooled odds ratio for different populations and ages of menarche in the meta-analysesPopulationAge of menarche (years old)Number of studiesStatistical modelPooled OR (95% CI)p-value^a^p-value^b^(Publication bias)p-value^c^(Heterogeneity)I^2^% (95% CI)τ^2^ (95% CI)All < 13, < 12, < 1074Random-effects model1.153 (1.076–1.236)**** < 0.0010.171 < 0.00172.3(65.2–78.0)0.06(0.08–0.25)All < 12, < 1026Random-effects model**1.267 (1.087–1.475)**0.0020.051 < 0.00174.1(62.0–82.4)0.11(0.09–0.73)All ≥ 1369Random-effects model0.894 (0.849–0.943)**** < 0.0010.975 < 0.00166.3(56.7–73.8)0.03(0.04–0.13)Africa < 13, < 123Random-effects model0.836 (0.220–3.184)0.7930.873 < 0.00191.6(78.4–96.7)1.31(0.28–58.38)East Asia < 13, < 126Random-effects model1.127 (0.946–1.343)0.1810.2980.00768.4(25.4–86.6)0.03(0.00–0.19)Europe < 13, < 1219Random-effects model**1.154 (1.068–1.246)*** < 0.0010.5920.00255.2(25.0–73.3)0.01(0.00–0.08)North America < 13, < 1219Common-effect model1.069 (1.029–1.110)*** < 0.0010.1120.15025.4(0.0–57.3)0.00(0.00–0.06)Oceania < 132Common-effect model1.150 (1.015–1.304)**0.028N/A0.6230.0(0.0–0.0)0.00(0.00–0.00)South America < 13, < 124Common-effect model0.911 (0.765–1.085)0.2980.5970.19536.3(0.0–77.9)0.04(0.00–1.26)South Asia < 13, < 126Random-effects model1.290 (0.651–2.557)0.4650.526 < 0.00186.8(73.5–93.4)0.65(0.20–4.67)Southeast Asia < 13, < 124Random-effects model0.849 (0.622–1.159)0.3010.5730.05061.7(0.0–87.1)0.05(0.00–0.70)West Asia < 13, < 12, < 1011Random-effects model1.699 (1.145–2.520)0.008*0.463 < 0.00181.7(68.4–89.4)0.35(0.17–3.05)Population-based control < 13, < 1239Random-effects model1.107 (1.080–1.135)**** < 0.0010.8320.01535.9(5.1–56.6)0.00(0.00–0.06)Hospital-based control < 13, < 12, < 1035Random-effects model**1.263 (1.058–1.507)**0.0100.108 < 0.00183.3(77.6–87.5)0.23(0.17–0.70)OR odds ratio, CI confidence interval, N/A Not available^a^Random-effects model was used when the p-value for heterogeneity test was < 0.10 or I^2^ > 50%; otherwise, the common-effect model was used^b^Egger’s test to evaluate publication bias, p-value of < 0.05 is considered statistically significant^c^p-value of < 0.10 is considered statistically significant for Q statistics*Significant p-value (< 0.05) for pooled OR
Publication bias
The publication bias of the 74 included studies was evaluated with Egger’s test. The shape of the contour-enhanced funnel plot as depicted in Fig. 6 did not show any obvious asymmetry in the overall meta-analysis. Overall, there was no significant publication bias found among these 74 studies (p = 0.12). Moreover, there was no statistically significant publication bias found in any of the subgroup analyses. Summary of publication bias is available in Table 1.
Discussion
The meta-analysis of 74 studies worldwide revealed an association between early menarche (before age 13) and an increased risk of female breast cancer, with a pooled odds ratio (OR) of 1.15, reflecting a 15% higher likelihood of developing breast cancer compared to those with later menarche. Although statistically significant, this effect size was modest. An odds ratio of approximately 1.15 indicates a small increase in risk and should be interpreted cautiously. Early menarche is better understood as a weak but consistent risk factor that contributes alongside multiple reproductive, genetic, lifestyle, and environmental influences. On its own, the magnitude is unlikely to have substantial clinical impact at the individual level and may partly reflect residual confounding and variability across study designs. Notably, the data for menarche before age 12 showed an even higher pooled OR of 1.27, indicating an increased risk, suggesting that earlier menarche may have a stronger impact on breast cancer development. On the contrary, menarche at age 13 or older was associated with a lower pooled OR of 0.89, indicating a reduced likelihood of developing breast cancer compared to those with earlier menarche. These results suggest a modest increase in breast cancer risk among females who experienced menarche at a younger age, likely due to the combined influence of factors such as prolonged estrogen exposure [106] across diverse populations. During puberty, hormones and growth factors, notably estrogen and insulin-like growth factor-1 (IGF-1), play a crucial role in breast development [107]. This period is marked by rapid cell proliferation, leading to ductal branching and the creation of terminal end buds [107, 108]. These buds later develop into terminal ductal lobular units, which are the primary sites where the majority of breast cancers begin [108, 109]. Therefore, early menarche likely increases the risk of breast cancer primarily due to these developmental processes [110].
The regions included in this meta-analysis exhibited varying associations between menarche before age 13 and the risk of breast cancer. The West Asia subgroup showed the strongest association (OR = 1.70), suggesting a higher risk of breast cancer development in this region, followed by Europe and Oceania (both OR = 1.15), and North America (OR = 1.07). It is important to note that regional variability may reflect differences in statistical power and precision rather than true biological differences. The pooled estimates for Africa, East Asia, South America, South Asia, and Southeast Asia were not statistically significant. The smaller effect sizes and wider confidence intervals in these regions suggest reduced statistical power, likely due to the limited number of available studies and smaller sample sizes. The lack of statistical significance should therefore be interpreted as a reflection of imprecision rather than absence of effect. In addition, publication year does not necessarily correspond to the period in which participants were born or when breast cancer cases were diagnosed. Secular trends in childhood nutrition, height growth, obesity, and age at menarche differ across birth cohorts and regions, and many of the included studies likely drew on women diagnosed decades earlier. Therefore, part of the regional variability may reflect differences in underlying birth cohorts rather than true geographic differences.
Moreover, differences in data quality, diagnostic period, and population characteristics may further contribute to regional variability. To capture this heterogeneity more clearly, the results for South Asia and East Asia are discussed separately, acknowledging their distinct epidemiological and cultural contexts. For instance, normal breast epithelium is more likely to be estrogen receptor-positive (ER +) in White Australian women than in Japanese women [111], which may be linked to variations in the structure and gene expression of normal breast tissue. Asian women in Asia generally exhibit lower breast cancer rates despite having smaller, denser breasts compared to Caucasian women. Nevertheless, within Asian populations, cancer risk may correlate with breast density [112]. Regional variability may also be influenced by differences in menopausal status and tumor receptor subtypes (such as ER/PR positivity). Ideally, the association between early menarche and breast cancer would be examined separately for premenopausal and postmenopausal women, as well as across hormone receptor subtypes. However, these variables were not consistently reported across studies, particularly in low- and middle-income regions, preventing meaningful stratification by menopausal or tumor biology subgroups. These differences highlight the importance of considering both the size and precision of regional effect estimates when interpreting global trends in early menarche and breast cancer risk.
On the other hand, a cross-sectional study comparing breast cancer risks among Lebanese and Lebanese American women revealed that Lebanese women had higher risk factors for developing breast cancer [113]. This study showed that Lebanese women had higher rates of smoking (both cigarettes and hookah) and tended to live closer to highways or main roads, whereas Lebanese American women breastfed for longer periods, exercised more, and consumed significantly more vegetables and fruits, all of which are protective against breast cancer [113]. Furthermore, a case–control study by Ziegler et al*.* (1993) investigated breast cancer risk factors among Asian American women versus women in Asia. The study found that Asian American women had higher breast cancer rates, largely due to Western lifestyle adoption. Key factors include dietary changes, reproductive behaviors, and overall lifestyle differences, indicating that environmental and lifestyle factors significantly contribute to breast cancer risk beyond genetic predispositions [114].
Subgroup analyses using both population-based and hospital-based controls showed an association between early menarche (before age 13) and the risk of female breast cancer, suggesting consistency in the findings across different control source types. Ideal control groups in epidemiologic studies are scarce, necessitating both theoretical and empirical efforts to minimize biases. Studies across various diseases, including vulvar cancer [115], idiopathic pulmonary fibrosis [116], and stroke [117], had identified a selection bias when hospital-based data are used in epidemiological or case–control studies, highlighting the value of population-based data for providing more accurate insights into patient characteristics. Hospital controls, suited for hospital cases, may not adequately represent the general population, whereas population controls are preferred despite their expense and procurement challenges [118]. Nonetheless, it has been suggested that carefully selected hospital controls can yield risk estimates comparable to those from population controls for a range of diseases [119].
It is important to note that there has been a secular decline in the age of menarche in many populations over the past century [120]. This trend may influence the relationship between age at menarche and breast cancer risk, with earlier menarche potentially conferring a higher risk. The current analysis indicates that women in regions such as West Asia, Europe, and North America, where secular trends have been most pronounced, may exhibit a heightened risk of breast cancer due to earlier menarche. Future research should consider how these trends in menarche timing, in conjunction with other environmental and lifestyle factors, influence breast cancer risk across generations. Regional differences in the age at menarche could be influenced by various factors, including socioeconomic conditions, cultural practices, and dietary patterns [120]. For instance, women in West Asia, Europe, and North America have generally experienced earlier menarche compared to populations in other regions, potentially contributing to the higher breast cancer risk observed in these areas. These findings emphasize the importance of developing region-specific risk assessment models to account for geographic variability in risk factors and outcomes.
An important consideration in this meta-analysis is the exclusion of cohort studies, which could offer valuable insights into the association between early menarche and breast cancer risk. Cohort studies, particularly prospective ones, avoid the differential recall bias inherent in case–control studies, where individuals with breast cancer may recall the timing of menarche differently compared to controls. This makes cohort studies less susceptible to recall bias [121], and their use could strengthen the evidence on this association. However, the inclusion criteria of this meta-analysis focused on observational studies that reported raw case and control counts, as this allowed for a more standardized and consistent approach to pooling data across studies. Cohort studies typically report results in the form of hazard ratios (HRs) or relative risks (RRs), which differ from the raw case–control counts required for the present meta-analysis. This data format difference posed a challenge to integrating cohort studies into the current analysis.
A literature search had been conducted and identified two relevant cohort studies. However, these studies reported their findings using different metrics (one with HRs [23] and the other with ORs [122] derived from adjusted models), which prevented their inclusion in the pooled analysis due to the absence of raw case–control data. Converting these results into a common metric would introduce additional assumptions and methodological complexities, potentially compromising the reliability of the estimates. Despite this, a discussion of these cohort studies has been included in this manuscript, acknowledging their findings and highlighting the complementary evidence they provide in relation to the case–control studies included in the present analysis. Future research may benefit from incorporating both cohort and case–control data, potentially pooling the results from cohort studies separately to avoid such discrepancies.
In addition to the findings from the case–control studies included in this meta-analysis, two cohort studies provide valuable insights into the relationship between menarche age and breast cancer risk. Yang et al. (2022) found that compared to menarche at age 13, menarche at age ≤ 11 years was significantly associated with an increased risk of early-onset breast cancer at age 40 (OR = 2.62, 95% CI: 1.38–4.97, p = 0.003), while menarche at age ≥ 16 years was significantly associated with a decreased risk (OR = 0.13, 95% CI: 0.03–0.53, p = 0.005) [122]. This suggests that earlier menarche may be a stronger predictor of early-onset breast cancer. In a similar vein, Bodicoat et al. (2014) reported that women whose menarche occurred at age 15 or older had a lower risk of breast cancer (OR = 0.78, 95% CI: 0.62–0.99) compared to those with menarche at age 13–14 [23]. However, they did not find an increased risk of breast cancer for women with menarche at age 12 or younger. These findings align with the results of our meta-analysis, further supporting the association between later menarche and a reduced risk of breast cancer, and suggesting that menarche at a younger age may not always be associated with higher risk, depending on the context.
The present meta-analysis faces several limitations, which include heterogeneity in study designs and populations, varying definitions and measurements across studies, and potential confounding factors such as family history and lifestyle variables that may not be consistently controlled. A key limitation of this meta-analysis is the reliance on raw numbers to calculate odds ratios, as this approach does not account for potential confounding variables. While adjusted odds ratios can provide more precise estimates by controlling for confounders, their inconsistent availability and the variability in adjustment models across studies precluded their use in this analysis. Despite this limitation, the use of raw data ensures methodological consistency and minimizes the heterogeneity that could arise from combining studies with differing adjustment factors. Also, limiting inclusion to studies from the past 10 years improves relevance to current populations but may have excluded older research with valuable longitudinal insight. Nonetheless, secular trends in age at menarche and breast cancer risk factors justify prioritizing recent data.
Additionally, the inability to obtain and analyze the data for a full range of specific menarche ages due to the varying categorization of menarche ages in the selected articles poses a significant constraint. Menopausal status and ER/PR receptor subtype were inconsistently reported across the included case–control studies, and were often missing entirely in studies from low- and middle-income regions. Consequently, it was not possible to assess whether regional differences in the association between early menarche and breast cancer risk were influenced by menopausal distribution or hormone-receptor-positive disease, nor to compare associations separately for premenopausal and postmenopausal women. On top of that, some regional subgroup estimates were based on a small number of studies or participants, which reduced statistical power and resulted in wider confidence intervals, limiting precision in regional comparisons. As most studies did not report birth cohort or year of diagnosis, we could not adjust for temporal changes in height growth, weight gain, or secular declines in menarche age, which may partly explain regional variability. Also, retrospective case–control designs introduce risks of recall and selection biases, and variability in risk estimates between studies, as well as differences in data quality and reporting, further challenge the generalizability of findings.
Furthermore, differences in study design, quality, and sources of bias may contribute to the observed heterogeneity in results. Population-based studies generally provide more representative data, whereas hospital-based studies may introduce selection biases, potentially affecting the magnitude and precision of the observed associations. The quality of the included studies varied, with lower-quality studies possibly yielding less reliable estimates. Additionally, publication bias could have inflated the estimates, as studies with significant findings are more likely to be published. Regional variations may reflect differences in population characteristics, healthcare systems, and environmental exposures, all of which could influence both the risk of breast cancer and the age at menarche. These limitations emphasize the need for cautious interpretation and highlight the areas for future research refinement.
Taken together, the results of this meta-analysis suggest that early menarche is associated with a modestly higher risk of breast cancer. However, the magnitude of this effect is small and should be interpreted with caution. As all included studies were observational, these findings demonstrate association but not causation. Moreover, the heterogeneity among regions may reflect differences in study design, diagnostic periods, and reporting practices rather than true biological variation. The reliance on case–control data introduces potential recall and selection biases, and the exclusion of cohort studies due to limited source of studies and differing data formats may have limited temporal inference. Additionally, incomplete information on menopausal status and hormone receptor subtypes across studies may have influenced pooled estimates. Future research integrating cohort data and individual participant-level analyses would strengthen causal interpretation and regional comparability.
Conclusions
In conclusion, this meta-analysis of 74 case–control studies from 28 countries indicates that women who experienced menarche before age 13 have a modestly increased risk of breast cancer (OR = 1.15), while those with menarche at age 13 or older show a correspondingly lower risk. Regional differences were observed, with higher odds in West Asia, Europe, Oceania, and North America. Nevertheless, these patterns may partly reflect variations in study availability, population characteristics, and diagnostic timing. The findings provide updated evidence supporting early menarche as a long-recognized reproductive risk factor for breast cancer, yet the small effect size indicates that it is one of many contributors to overall risk. Given the observational nature of the included studies, these results should not be interpreted as causal. Several limitations, including the reliance on case–control data and incomplete reporting of menopausal and receptor status, necessitate cautious interpretation of the findings. Future studies integrating cohort data and region-specific analyses may help clarify the influence of menarche timing within evolving population contexts.
Supplementary Information
Below is the link to the electronic supplementary material.Supplementary file1 (DOCX 48 KB)Supplementary file2 (CSV 7 KB)
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Wells G, Shea B, O’Connell D, et al The Newcastle-Ottawa Scale (NOS) for assessing the quality of nonrandomised studies in meta-analyses. In: https://www.ohri.ca/programs/clinical_epidemiology/oxford.asp.
