Age at menarche and idiopathic pulmonary fibrosis: a two-sample mendelian randomization study
Jiaqi Cao, Yazhou Ma, Wei Zhao, Chunlai Feng

TL;DR
Later age at menarche is linked to a higher risk of idiopathic pulmonary fibrosis, suggesting lower estrogen exposure may increase disease susceptibility.
Contribution
This study is the first to use Mendelian randomization to establish a causal link between age at menarche and idiopathic pulmonary fibrosis.
Findings
Genetically later age at menarche was associated with increased IPF risk (OR = 1.0014, p = 0.001).
Results were consistent across multiple MR methods and showed no heterogeneity or pleiotropy.
Findings suggest a protective role of endogenous estrogen against IPF.
Abstract
Sex difference in the incidence rate of idiopathic pulmonary fibrosis (IPF) indicates that estrogen has a certain protective effect on the disease. Nevertheless, there is a dearth of study investigating the association between factors pertaining to endogenous estrogen exposure level, such as age at menarche (AAM) in women, and IPF. Our study intended to employ Mendelian randomization (MR) method to elucidate the causal association between AAM and IPF. Our study utilized AAM as a measure of endogenous estrogen exposure and investigated its causal effect on the risk of IPF through MR. We employed the inverse variance weighted (IVW) method to assess the causal relationship between AAM and IPF risk, with supplementary analyses conducted using the weighted median estimator (WME) and MR-Egger method. Several sensitivity analyses were performed to assess the dependability of MR estimates. A…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInterstitial Lung Diseases and Idiopathic Pulmonary Fibrosis · Eosinophilic Disorders and Syndromes · Asthma and respiratory diseases
Introduction
Idiopathic pulmonary fibrosis (IPF) is a type of interstitial lung disease characterized by chronic progressive pulmonary fibrosis accompanied by irreversible reduction of lung function [1]. The main pathological changes of IPF include epithelial cell damage, epithelial mesenchymal transition (EMT), deposition of extracellular matrix (ECM) proteins, and proliferation and differentiation of fibroblasts [2]. In recent years, the incidence of IPF has been increasing globally [3]. Once diagnosed with this disease, the survival period is short, the mortality rate is high, and the prognosis is poor [4]. Therefore, early identification and prevention for this disease are necessary. The incidence of IPF is higher in males compared to females, and this gender difference may be partly attributed to the differential exposure to sex hormones, particularly estrogen [5, 6].
The role of estrogen in IPF is still controversial. Some studies suggested that estrogen played a protective role in the development of IPF [7, 8]. However, an animal experiment found that, compared with males, female rats showed a higher mortality rate and more severe fibrosis [9]. Menarche is a milestone event in the sexual development of adolescent girls and is closely related to the level of endogenous estrogen exposure. Current studies on the relationship between age at menarche (AAM) and lung diseases mainly focus on asthma and lung cancer [10]. These studies showed that early menarche may lead to lower lung function in adult women and increased risk of asthma [11, 12], while later initiation of menstruation was associated with a reduced risk of developing lung cancer [13]. However, there are currently no clinical studies investigating the causal relationship between IPF and AAM. This may be due to the fact that surveys on respiratory diseases and AAM have primarily been conducted in younger cohorts, whereas IPF is predominately an elderly disease prone to recall bias during surveys of their AAM. Therefore, further research is needed to explore the relationship between AAM and IPF.
Mendelian randomization (MR) is a method that uses genetic variations as instrumental variables to investigate the potential causal relationship between exposure and outcome. MR is based on three assumptions. Firstly, genetic variations must be strongly associated with exposure. Secondly, genetic variations should not be correlated with any known confounding factors. Lastly, genetic variations should only affect the outcome through exposure [14]. Since alleles are randomly assigned at conception, they are not associated with confounding factors, and diseases cannot affect genetic variations, consequently, MR can also avoid reverse causal problems. We used AAM as a substitute index for endogenous estrogen exposure [15], selected single nucleotide polymorphisms (SNPs) that exhibited a strong correlation with AAM as instrumental variables, with IPF as the outcome, the method of two-sample MR was employed to analyze the causal association.
Methods and materials
Data source
The menarche age data were obtained from the MRbase database.
(https://gwas.mrcieu.ac.uk/), the genome-wide association study (GWAS) were derived from the Within Family Consortium, which included a total of 29,346 participants [16]. AAM is treated as a continuous measure in years, which is usually asked directly of study participants. As for the IPF data, we utilized a recent GWAS data as an outcome dataset, including 1369 patients with IPF and 435,866 controls [17]. The entire population is European. All the GWAS datasets can be obtained from IEU open GWAS Project (https://gwas.mrcieu.ac.uk/, Age at menarche GWAS ID: ieu - b − 4822. IPF GWAS ID: ebi-a-GCST90018120).
Instrumental variable filtering
In the exposure database, we initially conducted a screening of SNPs using a genome-wide significance threshold of p < 5*10^-8. Subsequently, we eliminated SNPs that exhibited linkage disequilibrium (LD) with a threshold of R^2 < 0.001. The strength of the instrumental variables was assessed using the F-statistic. An instrumental variable is considered strong if the F-statistic > 10 [18]. All of the SNPs we utilized in our analysis had F statistics ranging from 55 to 78, well above the threshold of F-statistic > 10. This suggests that the existence of weaker instrumental variables is unlikely. For SNPs not found in the outcome summary data, we employed the online website tool LD-Link (http://ldlink.nih.gov) to identify proxy SNPs with a high degree of LD (r^2 > 0.8) [19]. Ultimately, a total of 9 SNPs were utilized for the MR analysis (Table 1).
Table 1. Characteristics of SNPs used as instrumental variablesSNPEAOAEAFBetaSEp-valueFrs10978435CT0.315574-0.08980.01444.13E-1069rs16924631CG0.13839-0.09040.01643.67E-0861rs2090409AC0.314781-0.09670.0131.05E-1377rs2362643AG0.668282-0.08740.01347.39E-1175rs314268AG0.661809-0.14010.01285.09E-2878rs543874GA0.206329-0.07920.01443.80E-0869rs7114175TA0.4996880.07460.01286.26E-0978rs72887143TA0.166020.08790.01593.55E-0863rs79627842CT0.127606-0.13010.01828.23E-1355EA: effect allele; OA: other allele; EAF: effect allele frequency; SE, standard error; F, F-statistic
Statistical analysis
We employed the inverse variance-weighted (IVW) method as our primary analysis approach, which is known to yield accurate results under the assumption that all instrumental variables meet the necessary assumptions [20]. Additionally, we utilized the weighted median estimator (WME) [21] and the MR-Egger method [22]. The WME method can provide relatively stable results when at least 50% of the instrumental variables satisfy the assumptions. Similarly, in the absence of heterogeneity, the MR-Egger method can yield relatively robust results. However, it is worth noting that the WME and MR-Egger methods exhibit slightly lower statistical efficiency, and thus, we employed them as supplementary analyses. To assess the heterogeneity among instrumental variables, we employed the Cochrane Q test [23]. Furthermore, we employed the MR-Egger intercept test [22] to evaluate whether the instrumental variables exhibit directional pleiotropy. A significant deviation of the intercept from zero suggests the presence of directional pleiotropy among the instrumental variables. Moreover, we employed the Mendelian Randomization Pleiotropy RESidual Sum and Outlier(MRPRESSO) method [24] to identify the outliers that may introduce horizontal pleiotropy. If outliers were detected, the corresponding SNPs were removed from the analysis. Additionally, we employed the leave-one-out method, which evaluates the remaining SNPs’ estimates of the results by deleting each SNP one by one, to evaluate the impact of individual SNPs on the results. All statistical analyses were conducted using the TwosampleMR package [25] and MRPRESSO package [24] in the R (version 4.1.2).
Result
The results obtained through the IVW method indicate a positive correlation between the AAM and the increased risk of IPF (odds ratio [OR] = 1.0014, 95% confidence interval [CI] = 1.0005∼1.0023, p = 0.001). Similar estimates were obtained through the WME method (OR = 1.001, 95% CI = 1.000∼1.002, p = 0.093) and the MR-Egger method (OR = 1.000, 95% CI = 0.996∼1.004, p = 0.967) (Fig. 1). However, these supplementary methods have lower testing efficiency, particularly the MR-Egger method, resulting in larger confidence intervals.
Fig. 1. Scatter plot for the causal effect of AAM on IPF risk
The Cochran’s Q test results indicated that there was no heterogeneity observed among the SNPs (Q = 98.46, p = 0.15). Additionally, the MR-Egger intercept test revealed an intercept value of 0.012 and a p-value of 0.39, suggesting the absence of directional pleiotropy in the results. Furthermore, there was no outlier displayed by MRPRESSO method, which indicated that the results were not influenced by abnormal SNPs and exhibited pleiotropy. The leave-one-out method analysis demonstrated that none of the SNPs had a significant impact on the results (Fig. 2). Consequently, our findings can be considered robust.
Fig. 2. Leave-one-out analysis of the effect of Age at menarche on IPF
Discussion
Our study found that a later onset of menarche is associated with an increased risk of IPF. We used AAM as a surrogate of the level of exposure to endogenous estrogen. A later AAM means a lower cumulative level of exposure to endogenous estrogen, further supporting the hypothesis that endogenous estrogen plays a protective role in IPF.
IPF exhibits a higher prevalence in males, with males experiencing an earlier age of onset and a poorer overall survival rate compared to females. Consequently, female gender is considered as a protective factor against the development of this disease [26]. Males have less accumulation of endogenous estrogen throughout their lifetime in comparison to females. In the female population, a later AAM corresponds to reduced exposure to endogenous estrogen. Our result suggested that a delayed AAM was associated with an increased risk of developing IPF, thereby supporting the protective role of female gender and estrogen in the incidence of IPF. However, there is a lack of clinical studies that have examined the impact of gender-related factors, such as AAM and estrogen levels, on the incidence and prognosis of IPF. Therefore, the result of our study has important guiding significance for future clinical investigations in this area.
The effect of AAM on the risk of IPF can be mediated by estrogen. Research has shown that estrogen can exert anti-inflammatory effects by decreasing tumor necrosis factor-α production, interferon-γ expression and natural killer cell activity [27]. Inflammatory response is also an important mechanism of IPF [28], therefore estrogen may play a protective role in IPF through anti-inflammatory effects. EMT is an important pathological process in IPF [29]. Transforming growth factor-β (TGF-β)/Smad cascade is a key fibrotic axis that drives EMT by increasing the ECM synthesis and collagen deposition. Andugulapati SB et al. found that, phytoestrogen Biochanin-A mitigated the development and progression of pulmonary fibrosis by modulating the TGF-β/Smad3 pathway and ameliorating the fibrotic cascade of events [30]. Furthermore, estradiol could specifically down-regulate the expression of chloride intracellular channel protein 3 and retinol binding protein 7 genes related to the pathogenesis of pulmonary fibrosis in human bronchial epithelial cells, and affect the pathways associated with pulmonary fibrosis [7]. Additionally, 2-methoxy estradiol, a metabolite of estradiol in the human body, has been found to inhibit the growth of human pulmonary artery smooth muscle cells and human lung fibroblasts in a concentration-dependent manner. It also exhibited strong anti-inflammatory, anti-fibrosis, and anti-vascular remodeling effects in models of pulmonary fibrosis and hypertension [31].
Several studies conducted on mice with IPF also support that estrogen may have a protective role in IPF. Female mice displayed relatively mild pulmonary fibrosis changes and longer survival compared to male mice [32, 33]. Oophorectomy in female mice gave rise to significant thickening of airway smooth muscle and promoted the progression of airway fibrosis, which could be reversed by estrogen replacement therapy [34]. However, in a study targeting rats, the opposite phenomenon was observed: female rats experienced more severe bleomycin induced pulmonary fibrosis than age matched male rats, while female rats experienced reduced pulmonary fibrosis after oophorectomy, and estrogen replacement therapy exacerbated pulmonary fibrosis [9]. We speculate that this phenomenon may be related to the complex mechanism of estrogen’s action in the lungs [10]. In addition, SolopovP et al. discovered that the supplementation of phytoestrogens acting by stimulating estrogen receptors could improve lung function impairment in murine models of hydrochloric acid-induced pulmonary fibrosis [8]. The findings of these studies are mutually confirmed with the result of our study to some extent. Early identification and intervention of IPF patients with late menarche are needed to improve the prognosis of the disease and prolong survival. In the future, it is still necessary to further explore the specific mechanism of estrogen’s role in IPF, so as to develop new treatments.
This study represents the first application of MR to investigate the potential causal relationship between AAM and IPF. MR studies offer advantages over traditional observational studies by reducing the risk of confounding. In this study, we utilized recently published large-scale GWAS data, which provide increased statistical power due to larger sample sizes. Furthermore, we conducted multiple sensitivity analyses to assess the robustness of our findings. Consequently, the results of our study hold significant reference value for informing future clinical research. However, there are several limitations to consider. Firstly, we used GWAS data on menarche from the Within Family Consortium, which includes both men and women, the lack of sex-specific data from the public database weakens the causal relationship between AAM and IPF in our study, making our results more conservative. Secondly, all participants in our study were of European descent, thus limiting the generalizability of our findings to other ethnicities. Further cross-ethnic studies, if corresponding GWAS data are available, are necessary to address this limitation. Thirdly, we did not use age of menopause as an exposure because after screening the GWAS instrumental variables for menopausal age, only three SNPs were obtained as instrumental variables. We believe that the instrumental variables are too few to serve as a genetic proxy for age of menopause. In the future, when there are larger scale GWAS studies on age of menopause and more instrumental variables for age of menopause can be obtained, MR analysis can be conducted on the relationship between age of menopause and IPF. Lastly, it is important to note that MR assumes a linear correlation between AAM and IPF, although the actual relationship may not necessarily follow this pattern.
Conclusion
Our research has presented the finding that suggests a correlation between a delayed AAM and an elevated susceptibility to developing IPF, possibly attributed to changes in endogenous estrogen levels. Further studies are necessary to validate this hypothesis and elucidate the precise molecular mechanisms underlying this relationship.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Barratt SL Creamer A Hayton C Idiopathic pulmonary fibrosis (ipf): an overview J CLIN MED 20187820110.3390/jcm 708020130082599 PMC 6111543 · doi ↗ · pubmed ↗
- 2Confalonieri P Volpe MC Jacob J Regeneration or repair? The role of alveolar epithelial cells in the pathogenesis of idiopathic pulmonary fibrosis (ipf)Cells (Basel Switzerland)20221113209510.3390/cells 11132095 PMC 926627135805179 · doi ↗ · pubmed ↗
- 3Hutchinson J Fogarty A Hubbard R Global incidence and mortality of idiopathic pulmonary fibrosis: a systematic review EUR RESPIR J 201546379580610.1183/09031936.0018511425976683 · doi ↗ · pubmed ↗
- 4Moss BJ, Ryter SW, Rosas IO. Pathogenic Mechanisms Underlying Idiopathic Pulmonary Fibrosis. ANNU REV PATHOL-MECH. 2022; 17:515–46. 10.1146/annurev-pathol-042320-030240.10.1146/annurev-pathol-042320-03024034813355 · doi ↗ · pubmed ↗
- 5Raghu G Weycker D Edelsberg J Incidence and prevalence of idiopathic pulmonary fibrosis AM J RESP CRIT CARE 20061747810610.1164/rccm.200602-163OC 16809633 · doi ↗ · pubmed ↗
- 6Sathish V, Martin YN, Prakash YS. Sex Steroid Signaling: Implications for Lung Diseases., Pharmacology. & therapeutics (Oxford). 2015; 150:94–108. 10.1016/j.pharmthera.2015.01.007.10.1016/j.pharmthera.2015.01.007PMC 452338325595323 · doi ↗ · pubmed ↗
- 7Smith LC Moreno S Robertson L Transforming growth factor Beta 1 targets estrogen receptor signaling in bronchial epithelial cells RESP RES 201819116010.1186/s 12931-018-0861-5PMC 611792930165855 · doi ↗ · pubmed ↗
- 8Solopov P Colunga Biancatelli RML Dimitropoulou C Dietary Phytoestrogens Ameliorate Hydrochloric Acid-Induced Chronic Lung Injury and Pulmonary Fibrosis in mice NUTRIENTS 20211310359910.3390/nu 1310359934684599 PMC 8536981 · doi ↗ · pubmed ↗
