An Overview of the Role of Genetic factors in Idiopathic Pulmonary Fibrosis: Insights from Epidemiology to Prognosis
Jiahao Liu, Zihan Yi, Ting Chen, Yinghua Ying, Yue Hu

TL;DR
This paper reviews how genetic factors influence the development and progression of idiopathic pulmonary fibrosis, a lung disease with poor prognosis.
Contribution
The paper provides a comprehensive overview of genetic variations and their roles in idiopathic pulmonary fibrosis, emphasizing personalized therapy.
Findings
Genetic variations in MUC5B, telomerase complex, and surfactant proteins impact IPF development and progression.
Genetic susceptibility influences both familial and sporadic forms of IPF.
Integrating genetic knowledge can improve diagnosis, treatment, and prognosis in IPF.
Abstract
Idiopathic pulmonary fibrosis (IPF), a chronic progressive fibrosing interstitial lung disease with an unclear etiology, is characterized by progressive respiratory impairment and a median survival of 3-5 years. The pathophysiology associated with genetic factors in IPF remains largely unknown, despite the fact that both familial and sporadic IPF exhibit genetic susceptibility. In this review, we comprehensively examine genetic variations associated with the functional roles of mucin 5B (MUC5B), telomerase complex, surfactant proteins, cytokines, signaling pathways, and epigenetic mechanisms. A multifaceted perspective derived from genetic, epidemiological, and clinical studies demonstrates that genetic variations exert differential impacts on the development, progression, and prognosis of IPF. We advocate for the application of genetic knowledge to facilitate the refinement of…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInterstitial Lung Diseases and Idiopathic Pulmonary Fibrosis
1. Introduction
Idiopathic pulmonary fibrosis (IPF) is a chronic interstitial lung disease (ILD) that primarily affects adults and is characterized by fibrosis and clinical symptoms such as dyspnea and progressive deterioration of pulmonary function. IPF typically has an unfavorable prognosis with a median survival of approximately three to five years 10. The disease manifests in both sporadic and familial forms, each exhibiting genetic susceptibility. Familial pulmonary fibrosis (FPF) is diagnosed when at least two first- or second-degree blood relatives are affected by ILD 11. FPF accounts for 5-20% of all IPF cases 12, 13. Approximately one-third of individuals with sporadic IPF have a family history of pulmonary fibrosis 14, with nearly a quarter of the genetic risk attributable to rare variants of known FPF-associated genes 13.
Genetic factors play a role in the development and progression of IPF (Figure 1). While the pathogenesis of IPF remains unclear, it is influenced by a complex interplay of environmental and host factors. Current genetic research has identified key contributors such as mucin 5B (MUC5B) 15, telomerase 5, surfactant proteins 17, cytokines, and related signaling pathways. Moreover, the role of epigenetic signaling pathways in regulating the development and progression of IPF has garnered increased attention. Advances in genetics have further deepened understanding of the pathophysiology of IPF and supported the development of personalized medical strategies. Treatment options for IPF are limited and currently consist of two anti-fibrotic drugs, pirfenidone and nintedanib, which slow disease progression but fail to reverse established fibrosis 10, 18, 19. Lung transplantation may be considered for patients with end-stage disease. Notably, genetic variations, including MUC5B promoter polymorphisms and the Desmoplakin (DSP) rs2076295 genotype, have been linked to differential therapeutic responses to pirfenidone and nintedanib 20, 21. The genetic perspective improves both the effectiveness of treatments and the quality of life of patients through targeted therapeutic interventions.
In summary, advancements in genetics have provided new perspectives for a deeper understanding of IPF. In this review, we examine the complex role that genetic factors have in the pathogenesis, progression, and prognosis of the disease. While this review emphasizes genetic and epigenetic drivers of IPF, we contextualize these findings with select transcriptomic and proteomic studies that elucidate functional consequences of genetic perturbations. In addition, we present the latest concepts that are useful for researchers to discover new diagnostic and therapeutic pathways.
2. Epidemiology
The global incidence of IPF demonstrates marked geographic heterogeneity, with multinational registry studies highlighting distinct epidemiological patterns 25. Notably, adjusted incidence estimates range from 3.5 to 13 per 100,000 individuals in the Asia-Pacific region compared to 0.9-4.9 in Europe and 7.5-9.3 in North America 26. While occupational exposures and environmental factors have been associated with disease risk 27, the strong familial clustering observed in 5-20% of cases provides compelling evidence for genetic predisposition 11.
Emerging genetic epidemiology reveals population-specific risk architectures. For instance, the frequency of the MUC5B rs35705950 minor allele differs significantly across populations: 0.007 in East Asian, 0.02 in African/African American, and 0.11 in European populations. While this variant confers stronger association in European populations 28, its attenuated association in Asian cohorts like the South Korean IPF registry 29 suggests modifier loci or environmental interactions may shape ethnic-specific risk profiles. These genetic differences parallel clinical disparities. Multicenter studies have shown that African American patients with pulmonary fibrosis are diagnosed, hospitalized, and die at earlier ages than patients of European or Latino ancestry 30. These differences in clinical outcomes among different ethnic groups require a thorough examination of underlying causes, particularly focusing on the impact of genetic variations on the development and progression of IPF.
3. Pathogenesis
3.1. Mucin and cell adhesion
The MUC5B gene, encoding a mucin critical for airway defense, is expressed in the mucous glands of the terminal and conducting airways, alveolar type II epithelial cells (AECII), and the epithelial cells of alveolar cysts 31. The T/G variant rs35705950 in its promoter is the strongest genetic risk factor for IPF (Table 1) 32, with epigenetic studies revealing this risk allele is associated with MUC5B promoter hypomethylation and transcriptional activation 2. Epigenome-wide studies reinforce DNA methylation's role, such as MUC5B promoter hypomethylation, and highlight Solute Carrier Family 6 Member 6 (SLC6A6) rs112271207, a taurine transporter with putative epigenetic effects, as a candidate gene 2. While its minor allele frequency varies, functional studies reveal conserved pathogenic mechanisms: excessive production of MUC5B may impair mucosal defense mechanisms and reduce the effectiveness of mucociliary clearance in removing inhaled particles and microorganisms (Figure 2). Elevated levels of MUC5B mucin can disrupt normal surfactant function in the alveoli and distal airways, potentially resulting in alveolar collapse and inflammatory reactions 33. Although these mechanisms are important in the pathogenesis of IPF, they are not exclusive determinants, and the precise role of MUC5B in IPF remains unclear. Recent multi-ancestry meta-analyses have also implicated Mucin 1 (MUC1), encoding another transmembrane mucin, as a novel susceptibility locus, highlighting mucin dysregulation as a broader mechanism in IPF pathogenesis 34. While Nitrogen Permease Regulator 3-like Protein (NPRL3) rs74614704, a regulator of mTORC1 signaling, further links mucin pathways to fibrotic remodeling 3, 35.
DSP, an integral component of desmosomal structures, is crucial for maintaining intercellular connections and tissue integrity. Mechanistically, DSP variants, including rs2076295, disrupt alveolar epithelial integrity (Figure 2) 4. Such dysfunction can trigger abnormal extracellular matrix deposition and accelerate the progression of pulmonary fibrosis 4. Notably, the genetic variations in MUC5B and DSP genes exhibit significant correlations with DNA methylation 36, suggesting that multiple genetic factors may interact synergistically to influence the development and progression of IPF. These findings position mucin and adhesion pathways as interconnected drivers of IPF pathogenesis.
3.2. Telomere shortening and dysfunction
Telomeres, protective nucleoprotein complexes at chromosome termini, progressively shorten with cell division. Critical telomere attrition triggers DNA damage response (DDR) pathways. In IPF, sustained DDR activation may compromise the function of AECs, inducing alveolar epithelial cell senescence 37. Mutations in telomerase reverse transcriptase (TERT), telomerase RNA component (TERC) 6, poly(A)-specific ribonuclease (PARN), and regulator of telomere length 1 (RTEL1) affect telomere length and stability and are associated with the development of pulmonary fibrosis (Table 1) 38. Notably, TERT expression in lung fibroblasts correlates with acetylated histone H3K9 binding at its promoter 39, suggesting epigenetic regulation of telomerase activity.
In patients with IPF, a novel mutation in the NHP2 ribonucleoprotein (NHP2) gene (p.Y24N) disrupts the nuclear import of the NHP2 protein, thereby reducing the levels of proteins critical for telomere maintenance and telomerase activity 40. Mechanistically, telomere dysfunction impairs AECII progenitor function: dysfunctional telomeres in AECII (Figure 2) disrupt differentiation and regenerative capacity, promoting alveolar collapse and fibrosis 41. Genome-wide association studies (GWAS) now extend telomere-related genetics beyond core telomerase components. For example, the Spindle Apparatus Coiled-coil Protein 1 (SPDL1) rs116483731, encoding a mitotic spindle assembly protein, has been identified as a novel risk factor that accelerates telomere attrition and cellular senescence, likely via mitotic errors that exacerbate replicative stress 8. Similarly, Kinetochore Scaffold 1 (KNL1) rs12912339 and Stathmin 3 (STMN3) rs76537958 variants, implicated in spindle assembly, suggest that mitotic errors contribute to alveolar stem cell dysfunction 3, 32.
While these mutations are associated with IPF, they are not detected in the majority of patients 42. Strikingly, in IPF patients homozygous for the non-risk MUC5B rs35705950 allele, rare functional variants in TERT, PARN, TERC, or RTEL1 are enriched 7, suggesting a potential genetic interaction between MUC5B and telomerase-related genes. It highlights the need for stratified genetic testing: MUC5B-centric screening may overlook telomerase-driven subtypes requiring distinct management. These findings underscore the interplay between genetic, epigenetic, and mitotic stress mechanisms in IPF pathogenesis.
3.3. Surfactants
Pulmonary surfactants, lipid-protein complexes synthesized by AECII, maintain alveolar integrity by reducing surface tension. Surfactant dysfunction directly contributes to IPF pathogenesis through AECII injury and aberrant repair 43 (Figure 2), leading to decreased alveolar stability, alveolar collapse, and the development of fibrosis. Beyond structural roles, surfactant proteins modulate host defense by regulating pro-inflammatory cytokines, chemotaxis factors, and tissue repair processes 43.
Four surfactant-associated genes—surfactant protein A1 (SFTPA1), surfactant protein A2 (SFTPA2), surfactant protein C (SFTPC) and the ATP-binding cassette-type family A member 3 transporter (ABCA3)—are implicated in familial and sporadic pulmonary fibrosis (Table 1) 17, 44. Distinct functional roles of SFTPA1 and SFTPA2 may arise from altered expression ratios that modulate surfactant activity 9. Autosomal dominant SFTPC mutations, like SFTPC-I73T, disrupt lamellar body maturation in AECII, causing surfactant accumulation and oxidative stress 16. Recessive ABCA3 variants, localized to lamellar body membranes, are frequently linked to pediatric interstitial lung disease, underscoring surfactant dysregulation as a pan-age mechanism 17.
3.4. Cytokine and signaling pathway dysregulation
The transforming growth factor-β (TGF-β) signaling pathway constitutes a central axis in IPF pathogenesis, with both genetic predisposition and downstream effector mechanisms contributing to fibrotic progression. TGF-β1 induces fibroblast-to-myofibroblast differentiation through canonical Smad2/3 activation 45, 46, with epigenetic mechanisms critically modulating this process. Specifically, TGF-β1 promotes methylation of the Thy-1 promoter and recruits methyl-CpG binding domain protein 2 (MBD2) to activate the TGF-β-Smad pathway, creating a feed-forward loop that sustains fibroblast activation 47. Furthermore, TGF-β1 modulates H3K9me2/3 and H3K4me1/2/3 histone marks at Collagen Type I Alpha 1 (Col1A1), Connective Tissue Growth Factor (CTGF), and Plasminogen Activator Inhibitor 1** (**PAI-1) promoters, enhancing the transcriptional activity of fibrosis-associated genes 48. Nuclear accumulation of Smad proteins enables their function as transcriptional regulators of fibrosis-associated genes. Smad complexes also interact with histone deacetylases to remodel chromatin, facilitating extracellular matrix (ECM) protein expression 49. Experimental evidence indicates that Bone Morphogenetic Protein 4 (BMP4) antagonizes TGF-β1-driven effects by activating Smad1/5/9, thereby suppressing Smad2/3 phosphorylation and inhibiting myofibroblast differentiation and ECM synthesis 46. In addition, miR-17-92 and miR-29 inhibit TGF-β-driven fibrosis: miR-17-92 maintains alveolar homeostasis by blocking fibroblast activation 50, while miR-29 suppresses ECM synthesis via Yes-associated protein (YAP) signaling 51.
Canonical Wnt/β-catenin signaling exhibits spatial specificity in IPF, with alveolar epithelial-specific activation driving IL-1β-mediated TGF-β amplification 52. In addition, the non-canonical Wnt signaling pathway, exemplified by WNT5A, initiates cytoskeletal reorganization via the JNK and ROCK signaling pathways. F-actin-generated biomechanical tension facilitates proteolytic activation of latent TGF-β via integrin αv 53. Furthermore, genetic variants in A-kinase anchoring protein 13 (AKAP13), a kind of RhoA regulator, exacerbate TGF-β dysregulation and increased IPF susceptibility, underscoring the role of genetic predisposition in fibrotic remodeling 54. Notably, miR-26a downregulation in fibrotic environments exacerbates TGF-β1-induced ECM deposition, whereas its overexpression attenuates fibrosis 55, suggesting miRNA-based modulation of Wnt-TGF-β crosstalk. Emerging evidence positions mTOR signaling as both a TGF-β effector and independent genetic risk modulator. TGF-β-induced mTOR signaling depends on the canonical Smad signaling pathway and is independent of Phosphoinositide 3-kinase (PI3K)/AKT activity 56. Notably, genetic polymorphisms in DEP domain-containing mTOR-interacting protein (DEPTOR), an endogenous mTOR inhibitor, and other mTOR pathway components, like Regulatory Associated Protein of mTOR Complex 1 (RPTOR), have been linked to altered IPF risk, emphasizing the genetic modulation of mTOR-driven fibrosis 57. This multilayered interplay underscores the need for integrated therapeutic approaches to address signaling dysregulation in IPF.
The pathogenesis of IPF involves genetic susceptibility, alveolar epithelial dysfunction, and dysregulated signaling pathways. Mucin/adhesion abnormalities, telomere attrition, and surfactant dysregulation drive injury and senescence. Concurrently, cytokine/signaling imbalances and epigenetic modifications establish fibrotic cascades through genetic-epigenetic crosstalk. These mechanisms underscore the need for stratified therapies targeting molecular subtypes.
4. Clinical manifestations and diagnostic assessment
4.1. Clinical manifestations
IPF is a chronic and progressive lung disease characterized by a range of clinical symptoms. Initially, patients often experience exertional dyspnea, which may progress to persistent dry cough, weight loss, and other systemic symptoms as the disease advances. On physical examination, findings may include clubbing of the fingers and basal inspiratory crackles or rales, indicating impaired gas exchange and progressive pulmonary fibrosis. High-resolution computed tomography (HRCT) images typically reveal characteristic IPF findings, such as reticular opacities and honeycomb changes. Pulmonary function tests often demonstrate restrictive ventilatory defects and reduced carbon monoxide diffusion capacity. These tests are crucial for diagnosing IPF and monitoring disease progression.
With the advancement of precision medicine, genetic testing of patients with IPF is increasing. Genetic backgrounds can significantly influence the range and severity of clinical manifestations. For example, the* MUC5B* gene encodes key mucins present in the honeycomb cysts of patients with IPF 31. Carriers of the MUC5B promoter variant rs35705950 often exhibit more severe coughing symptoms and higher quantitative CT fibrosis scores, which may help quantify disease risk for relatives 58. These variants also correlate with distinct prognoses 59. In addition, further advances in imaging technology may lead to more accurate prognoses for patients. Observational studies show that patients with SFTPC or other surfactant-related gene mutations present with atypical radiological patterns, including cystic changes characteristic of interstitial pneumonia 60. These patients are often younger (average age 45 years) and exhibit lower forced vital capacity (FVC) and lung diffusion capacity for carbon monoxide (DLCO) compared to those with familial or sporadic IPF 61. Carriers of TERT mutations, even asymptomatic individuals, display significantly reduced DLCO and impaired DLCO response during exercise. HRCT scans in these patients reveal signs of pulmonary fibrosis and increased lung tissue volume fractions. Compared to non-carriers, TERT mutation carriers with IPF show a more pronounced reduction in lung diffusion capacity 62.
4.2. Diagnostic assessment
The 2018 guidelines 63 jointly published by the American Thoracic Society (ATS), the European Respiratory Society (ERS), the Japanese Respiratory Society (JRS), and the Asociación Latinoamericana de Torax (ALAT), recommend an initial assessment for suspected cases of IPF, with a focus on identifying possible known causes of ILD. After identifying a potential cause of ILD, a comprehensive evaluation is necessary to identify or rule out conditions such as hypersensitivity pneumonitis, connective tissue disease, pneumoconiosis, and iatrogenic problems. If diagnostic uncertainty persists after standard assessments, a multidisciplinary discussion (MDD) should be convened. This MDD should integrate clinical and HRCT findings to confirm or rule out the diagnosis. A definitive IPF diagnosis can be established by correlating appropriate HRCT and histopathological patterns.
Given the overlapping clinical features of chronic respiratory diseases, early differential diagnosis and the identification of relatives at risk for IPF are of significant value in genetic research. A machine learning model that uses gene expression data from peripheral blood mononuclear cells to predict IPF has been developed 64. This 44-gene model can accurately predict IPF in healthy controls and patients with tuberculosis, HIV, and asthma. Moreover, the model also allows subtyping of IPF subtypes. Thus, the model shows promise as a non-invasive diagnostic tool 64. Clinical research has shown that pathogenic variations in the telomerase complex genes are found in approximately 10% of patients with IPF, regardless of family history, suggesting the need for genetic counseling for all patients with IPF 65.
When assessing lung diseases, HRCT provides finer images than traditional CT scans. However, interpreting the morphology and extent of lesions using HRCT remains a relatively subjective process that requires a solid foundation in imaging knowledge, and robust quantitative diagnostic methods are lacking. Additionally, cost and radiation exposure must also be considered. Future genetic research could complement HRCT by establishing quantitative standards for the diagnosis of IPF 58, 66. The MUC5B gene is particularly interesting in this context. Polygenic risk scores for IPF and interstitial lung anomalies have been developed using data from a GWAS in IPF, confirming the predictive value of MUC5B in identifying individuals at risk for pulmonary fibrosis 67.
5. Treatment
5.1. Drug therapy
Pirfenidone and nintedanib are anti-fibrotic drugs approved by the Food and Drug Administration (FDA) and the European Medicines Agency (EMA) for the treatment of IPF. Clinical studies have shown that pirfenidone can slow the decline in FVC 18. However, its clinical effectiveness may vary among patients with IPF and different genetic backgrounds. In particular, a subset of patients with IPF who have high expression of genes related to ciliogenic epithelial cells responds more positively to pirfenidone 68. Nintedanib is a small-molecule tyrosine kinase inhibitor that reduces lung fibroblast proliferation, migration, and differentiation by inhibiting multiple growth factor receptors. Nintedanib may slow the rate of FVC decline in patients with IPF and prolong their survival 19. In a cohort analysis, patients carrying the DSP rs2076295 G allele experienced greater benefits in overall survival and lung function when treated with nintedanib compared with TT homozygous patients 69. To the best of our knowledge, no studies have specifically examined the association between genetic differences and adverse reactions to these two anti-fibrotic drugs in patients with IPF.
Although clinical trials have confirmed the effectiveness of these drugs in slowing lung function decline 18, 19, they cannot reverse or resolve existing fibrosis. Advances in genetic research have provided new insights into the antifibrotic mechanisms of pirfenidone and nintedanib, and evidence suggests that different gene expression patterns can influence drug efficacy 70. For instance, the Toll-interacting protein (TOLLIP) rs3750920 polymorphism has been shown to modify responses to N-acetylcysteine (NAC), with TT genotype carriers deriving therapeutic benefit while CC genotype carriers may experience harm 22. Similarly, the MUC5B rs35705950 variant has been linked to differential outcomes in IPF therapies 71. These findings underscore the potential of pharmacogenomics to guide personalized treatment strategies, as exemplified by the ongoing PRECISIONS trial (NCT04300920), which stratifies IPF patients by TOLLIP genotypes to optimize NAC therapy 22. Identifying genotypes associated with drug responses could enable the prediction of individual patient responses to these drugs. Furthermore, this may facilitate the development of personalized treatment strategies, including tailored drug selection, dosage adjustment, and combination therapies.
5.2. Lung transplantation
The pharmacological treatment of patients with IPF is primarily aimed at slowing the progression of fibrosis and providing palliative care for those in advanced stages. However, despite optimal therapeutic interventions, lung function in patients with IPF may still progressively deteriorate. IPF can reach a terminal stage characterized by severely impaired lung function that is unresponsive to medical treatment. For these patients, lung transplantation (LT) represents the sole therapeutic option capable of significantly prolonging their survival and enhancing quality of life. The 5-year survival rate following lung transplantation approaches 50% 72. Nonetheless, LT faces several obstacles, including the shortage of donor lungs, stringent technical requirements, and the substantial costs associated with the procedure. Additionally, genetic factors have been shown to influence both the success and quality of LT outcomes.
The risk of complications following LT is associated with post-transplant survival and represents a crucial consideration when assessing LT candidacy 73. Clinical observations indicate that lung transplant recipients with IPF (IPF-LTRs) have circulating T cells suggestive of immunodeficiency 74, thereby increasing their risk of hematological complications 75. Moreover, IPF-LTRs are more susceptible to rare telomere-related genetic variants and shorter telomere lengths compared to non-transplant individuals 76. Some researchers have recommended incorporating genetic factors into LT evaluations, such as measuring telomere length prior to transplantation and conducting genetic testing for telomere gene variants. Such tests may help identify IPF transplant recipients who are at higher risk of hematologic complications 75.
The results of a retrospective study indicate that LT is appropriate even in patients with telomerase-related gene mutations, provided there are no myelodysplastic symptoms and a systematic hematological evaluation is performed 77. In addition, genetic factors influence decisions regarding the administration of immunosuppressants after LT. The evidence suggests that standard immunosuppressive therapy should be maintained in young transplant recipients with shorter telomeres, even if immune deficiency is present 74.
6. Prognosis
Along with FVC and DLCO, age and gender are important prognostic indicators during the stable phase of IPF. Men generally exhibit worse survival rates than women 78. The GAP staging system integrates age, gender, and lung function parameters (FVC, DLCO) to predict mortality risk, though its sensitivity for short-term (1-year) outcomes remains limited 79. Biomarkers and genetic variations provide complementary prognostic insights and may enhance existing prognostic assessment tools by addressing their limitations.
A retrospective study involving a European cohort of 1751 patients with IPF revealed that the MUC5B T allele is a significant independent predictor of patient survival. No significant correlation was observed in patients under 56 years of age, whereas in the older cohort, individuals with the T allele exhibited better survival rates 80. Conversely, in another retrospective case-control study conducted within the Portuguese population, no associations were found between MUC5B variations and disease survival rates 81. These inconsistencies likely stem from confounding factors that were not accounted for in the study design or analysis. However, the preponderance of evidence suggests that the MUC5B minor T allele is associated with improved patient survival rates 81-83, independent of age, gender, FVC, and DLCO. This paradoxical association (increased disease risk but better survival) may be influenced by index bias, as studies predominantly including prevalent rather than incident IPF cases could disproportionately select for resilient individuals with the MUC5B risk allele, thereby inflating survival estimates 84. This underscores the significance of the MUC5B genotype in a survival prognosis model for patients with IPF 83. Telomere length has been independently associated with transplant-free survival in patients with IPF, as confirmed by observational cohort studies, further highlighting its role in the prognosis of IPF 85. Ethnic-specific outcomes are evident in the prognosis of IPF patients. Rare TERT variants enriched in Latin American IPF patients correlate with aggressive disease trajectories, though mechanistic links require further study 86. Japanese patients show distinct causes of death, like acute exacerbations, and prognoses compared to other ethnic groups 87.
A comprehensive proteomic analysis of multiple IPF patient cohorts has yielded a model based on the different expression levels of osteopontin (OPN), serum protein D (SPD), intercellular adhesion molecule 1 (ICAM1), and matrix metalloproteinase 7 (MMP7), allowing robust differentiation between progressive and stable IPF 88. These circulating serum proteins are significantly associated with clinical outcomes, increased mortality rates, and greater disease severity, highlighting the feasibility of developing serology-based methods to assess IPF progression 88. For instance, baseline serum levels of cathepsin B (CTSB) strongly correlate with the extent of lung function decline at one year. Patients with elevated serum CTSB levels are more likely to exhibit a progressive IPF phenotype, irrespective of GAP stage 89. Given the growing number of potential biomarkers, developing robust methods to evaluate their clinical utility is critical. The innovative progression index offers a quantitative measure of biomarkers' influence on clinical progress 88.
Recent studies have identified polymorphisms in TGF-β_1_, proprotein convertase subtilisin/kexin type 6 (PCSK6), and protein kinase N2 (PKN2) as critical determinants of disease progression and survival outcomes in IPF. The TGF-β_1_ T869C variant has been implicated in disease severity, with the TT genotype linked to reduced PaO₂ and increased D(A-a)O₂ at diagnosis, suggesting a role in accelerating functional decline 90, 91. Similarly, the PCSK6 rs35647788 variant has been associated with reduced transplantation-free survival, potentially through dysregulated proteolytic processing of profibrotic mediators 23. The PKN2 rs115982800 variant, located in the antisense RNA PKN2-AS1, correlates with rapid FVC decline, highlighting its role in cytoskeletal remodeling and fibroblast activation 24. These findings underscore the importance of genetic variants in modulating IPF progression.
Genes associated with endoplasmic reticulum stress 92, macrophage function 93, and mitochondrial dynamics 93 are implicated in the development and progression of IPF. These genes exhibit robust associations with canonical signaling pathways, including the apoptosis signaling pathway and the PI3K/AKT pathway, which collectively modulate the pulmonary immune microenvironment. Prognostic signature genes derived from the synergistic expression of m5C-regulated genes and immune-associated genes are likely to exert significant influence over immune and inflammatory responses, enabling precise prediction of survival outcomes in IPF patients 94. Continued advancements in genetic research are anticipated to unveil novel genetic determinants, thereby facilitating the development of refined prognostic models and elucidating the underlying mechanisms of IPF pathogenesis.
7. Genetic-Based Diagnosis and Treatment Models
Genetic testing serves principally as an auxiliary tool within the classical diagnostic and therapeutic strategies for IPF. Although genetic testing exhibits limited independent utility for IPF diagnosis, treatment, or prognosis, its integration as a central analytical framework in clinical practice remains feasible. This strategy is particularly advantageous for patient subgroups at elevated risk of disease progression and adverse outcomes, as it enables the customization of therapeutic strategies according to individual genetic profiles.
Genetic studies have delineated distinct patient subgroups for IPF diagnosis that can be used to identify high-risk populations. The 2022 ATS/ERS/JRS/ALAT guidelines emphasize the significance of genetic factors in stratifying the clinical severity of IPF and discuss their clinical applications. Despite guideline recommendations against using genomic classifiers as a standard diagnostic tool for usual interstitial pneumonia due to a lack of consensus 10, genetic factors remain essential in the personalized treatment strategies of precision medicine. As advancements in genetic etiology and cost-effective genetic testing continue, identifying patients who would benefit from genetic testing will grow increasingly critical. This will aid in developing intervention strategies to slow disease progression and selecting the most appropriate screening and management protocols (Figure 3). The Envision Genomic Classifier for IPF diagnosis, a clinically validated tool derived from whole transcriptome mRNA sequencing in transbronchial biopsy samples, integrates clinical factors and HRCT imaging, demonstrating high diagnostic sensitivity 95. Moreover, detecting the differential expression of the pirfenidone response gene across IPF subgroups is important, and sophisticated machine learning techniques can facilitate the development of classifiers that reflect cell-type characteristics and gene expression patterns 68.
Genetic variations in patients with IPF influence their responsiveness to pharmacotherapies, necessitating the integration of genetic insights into drug development (Figure 3). Modern genomics offers precise molecular targets for disease diagnosis and therapeutic intervention. Integrating gene expression profiles and pathological characteristics of patients with IPF into computational methods can expedite drug development 96. Artificial intelligence platforms that identify drug targets optimize the drug development process by streamlining target discovery 97. For instance, computational simulations have demonstrated that a plant-derived microRNA, osa-miR172d-5p, downregulates the expression of TAK1-binding protein 1 and fibrosis-related genes in TGF-β-stimulated pulmonary fibroblasts 98.
In murine studies, gene therapy has demonstrated the potential to arrest the progression of pulmonary fibrosis. Utilizing an adeno-associated virus serotype 9 (AAV9)-Tert vector for gene therapy, reactivation of telomerase in the lung can delay disease progression in murine models of pulmonary fibrosis 99. Prophylactic intratracheal administration of AAV9-Tspyl2 delays the onset of bleomycin-induced pulmonary fibrosis in mice by inhibiting the TGF-β/Smad3 signaling pathway 100. BIX01294, an exceptionally selective G9a histone methyltransferase inhibitor, reduces TGF-β-induced H3K9 methylation and matrix stiffness via upregulation of the PPARGC1A gene 101, thereby diminishing collagen deposition in the lungs of mice following bleomycin injury. These results suggest that therapeutic interventions targeting epigenetic repression mechanisms hold promise. Studies on the relaxin/RXFP1 axis 102 and microRNA-144-3p 103 have revealed their promising anti-fibrotic properties in patients with IPF, offering new avenues for IPF therapeutics.
The identification of genetic factors enables the stratification of IPF cases, the correlation of genetic and phenotypic profiles, and prediction of treatment effectiveness and patient outcomes (Figure 3). The molecular signatures of IPF-associated fibroblastic subtypes and their prognostic implications have been characterized through machine learning and single-cell analyses 104. New risk assessment models can be developed using bioinformatics and machine learning algorithms to facilitate the stratification of patient subgroups and refine personalized therapeutic strategies.
8. Discussion
Genetic factors significantly contribute to the epidemiological variance observed in IPF and are significant to its pathogenesis. The incidence and clinical outcomes of IPF exhibit variability across ethnicities and regions, primarily due to genetic predispositions. For instance, in Asian and North American populations, the minor allele frequency of the T allele in the MUC5B gene rs35705950 is significantly correlated with the IPF incidence. However, the lower incidence of IPF in the European population, despite a higher frequency of the minor T allele, suggests that additional factors affect disease development. In addition to genetic factors, environmental determinants, including occupational exposure and gender differences, have been implicated in the occurrence and progression of IPF (Figure 3). A broad consensus exists that environmental and host factors exert cumulative effects on IPF risk 105, and family history research offers an opportunity to elucidate the genetic contributions to IPF.
Family studies have revealed the so-called anticipation phenomenon, characterized by an earlier onset of pulmonary fibrosis symptoms in successive generations of families with TERT mutations. This phenomenon correlates with shorter telomeres in the offspring of families with telomerase mutations, which have been validated as a risk factor for IPF and are associated with adverse clinical outcomes 106. Although anticipation has not been observed for other genetic factors implicated in the etiology and progression of IPF (such as MUC5B, DSP, and surfactant proteins), evidence suggests potential interactions among these factors 7, 33. Epigenetic mechanisms may act both as independent contributors to IPF 36 and as mediators in its pathogenesis 36, 39. A bioinformatics-driven network of genetic interactions may systematically elucidate the role of genetic factors in IPF pathogenesis.
Genetic variations are important factors in the lung function and radiological features of patients with IPF, influencing the spectrum and severity of clinical phenotypes. Most patients with IPF experience a gradual decline in clinical, functional, and radiographic status, but some undergo acute respiratory exacerbations (AE-IPF). Research indicates that shortened telomere length is associated with an increased risk of AE-IPF or mortality in these patients 107, and that expression of the S100A8/A12 genes is linked to the etiology of AE-IPF 108. Genetic factors provide important insights into the stratification of clinical severity in patients with IPF, as acknowledged in the ATS/ERS/JRS/ALAT clinical guidelines, which recognize the clinical application of genetic classifiers 10. Physicians are encouraged to thoroughly investigate the optimal application of genetic factors to enhance disease diagnosis and treatment, in accordance with current clinical guidelines.
Genetic imaging, a multidisciplinary field integrating genetics and neuroimaging, assesses changes in brain morphology and function to determine the impact of genetic variations on individual behaviors and diseases. Imaging techniques have identified links between genetic variations and brain structures, for example, a correlation has been found between the SNP rs42352 in the Semaphorin 5A (SEMA5A) gene and bilateral hippocampal volume 109. Incorporating imaging data to examine associations between genetic variations and disease enhances diagnostic confidence and demonstrates the viability of a new diagnostic paradigm that leverages genetic factors to interpret imaging data and clinical phenotypes. This approach is particularly relevant in the IPF field, which relies heavily on diagnostic imaging tools (Figure 3). Studying imaging variants caused by genetic factors can highlight the role of imaging data in tracking disease progression. In addition, genetic factors can serve as quantifiable benchmarks and provide complementary assessment support in challenging imaging diagnoses.
In the realm of pharmacotherapy for IPF, FDA-approved treatments such as pirfenidone and nintedanib lack evidence of reversing pulmonary fibrosis, underscoring significant market opportunities for the development of novel IPF therapies. Understanding the genetic factors involved in IPF pathogenesis provides a theoretical basis for leveraging gene therapy to treat the disease. For example, Luxturna, the groundbreaking in vivo gene therapy for hereditary retinal dystrophy, transfers the RPE65 gene to retinal cells using an AAV vector, and its successful approval confirmed the feasibility of gene therapy 110. While experimental data from clinical trials are limited, AAV-based in vivo gene therapy has demonstrated the potential to halt the progression of pulmonary fibrosis in murine models, representing a promising avenue for IPF treatment.
Prognostic assessments significantly influence physicians' clinical decision-making. In the context of IPF, a disease characterized by complex genetic mechanisms and diverse clinical manifestations, machine learning and bioinformatics may be leveraged to develop personalized prognostic models to overcome the limitations of the GAP tier system in short-term risk prediction. The identification of additional biomarkers that can predict disease progression would be clinically useful, enabling timely adjustments to treatment protocols in response to individual patient progression. Current biomarkers exhibit variable reliability for predicting IPF progression, and identifying robust markers remains a challenge. Computer-assisted techniques that assign weights to each marker in disease progression could streamline the development of future clinical guidelines and facilitate the integration of new biomarkers.
8.1. Limitations
This review has several limitations. First, it is not a systematic review and has not undergone a formal quality assessment. Second, the articles included are limited to English-language publications. Third, we focused on describing the role of certain genetic factors in IPF, although some pathogenic mechanisms have not been studied in detail. Fourth, relevant articles may have been overlooked.
9. Conclusions
Genetic factors are of utmost importance in the etiology and progression of IPF. They significantly influence the epidemiological profile of IPF and its pathogenesis, clinical presentation, and prognostic outcomes. Current genetic research is poised to refine existing diagnostic frameworks and provide essential quantitative benchmarks for the development of personalized therapeutic strategies. By recognizing the importance of genetic factors and refining diagnostic and therapeutic methods tailored to these influences, it will be possible to better categorize therapeutic interventions and improve the accuracy of prognostic assessments. This approach aligns with the emerging paradigm of precision medicine.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Seibold MA Wise AL Speer MC Steele MP Brown KK Loyd JEA common MUC 5B promoter polymorphism and pulmonary fibrosis N Engl J Med 2011364150312 doi:10.1056/NEJ Moa 10136602150674110.1056/NEJ Moa 1013660 PMC 3379886 · doi ↗ · pubmed ↗
- 2Chin D Hernandez-Beeftink T Donoghue L Guillen-Uio B Leavy OC Adegunsoye A Genome-wide association study of Idiopathic Pulmonary Fibrosis susceptibility using clinically-curated European-ancestry datasetsmed Rxiv 2025 doi:10.1101/2025.01.30.25321017
- 3Allen RJ Stockwell A Oldham JM Guillen-Guio B Schwartz DA Maher TM Genome-wide association study across five cohorts identifies five novel loci associated with idiopathic pulmonary fibrosis Thorax 20227782933 doi:10.1136/thoraxjnl-2021-2185773568862510.1136/thoraxjnl-2021-218577 PMC 9329250 · doi ↗ · pubmed ↗
- 4Hao Y Bates S Mou H Yun JH Pham B Liu J Genome-Wide Association Study: Functional Variant rs 2076295 Regulates Desmoplakin Expression in Airway Epithelial Cells American Journal of Respiratory and Critical Care Medicine 2020202122536 doi:10.1164/rccm.201910-1958 OC 3255179910.1164/rccm.201910-1958 OCPMC 7605184 · doi ↗ · pubmed ↗
- 5Peljto AL Blumhagen RZ Walts AD Cardwell J Powers J Corte TJ Idiopathic Pulmonary Fibrosis Is Associated with Common Genetic Variants and Limited Rare Variants American journal of respiratory and critical care medicine 20232071194202 doi:10.1164/rccm.202207-1331 OC 3660284510.1164/rccm.202207-1331 OCPMC 10161752 · doi ↗ · pubmed ↗
- 6Moore C Blumhagen RZ Yang IV Walts A Powers J Walker T Resequencing Study Confirms That Host Defense and Cell Senescence Gene Variants Contribute to the Risk of Idiopathic Pulmonary Fibrosis American Journal of Respiratory and Critical Care Medicine 2019200199208 doi:10.1164/rccm.201810-1891 OC 3103427910.1164/rccm.201810-1891 OCPMC 6635791 · doi ↗ · pubmed ↗
- 7Dressen A Abbas AR Cabanski C Reeder J Ramalingam TR Neighbors M Analysis of protein-altering variants in telomerase genes and their association with MUC 5B common variant status in patients with idiopathic pulmonary fibrosis: a candidate gene sequencing study The Lancet Respiratory Medicine 2018660314 doi:10.1016/S 2213-2600(18)30135-82989135610.1016/S 2213-2600(18)30135-8PMC 6487850 · doi ↗ · pubmed ↗
- 8Dhindsa RS Mattsson J Nag A Wang Q Wain LV Allen R Identification of a missense variant in SPDL 1 associated with idiopathic pulmonary fibrosis Commun Biol 20214392 doi:10.1038/s 42003-021-01910-y 3375829910.1038/s 42003-021-01910-y PMC 7988141 · doi ↗ · pubmed ↗
