The Role of COL1A1, COL5A1, ACTN3, MMP3, and GDF5 Gene Variants in Common Sports Injuries: Systematic Review of ACL Rupture, Achilles Tendinopathy, and Stress Fractures
Shahd Abboud, Elizabeth Akam, David John Hunter, Sarabjit Mastana

TL;DR
This study reviews how specific gene variants may increase the risk of common sports injuries like ACL rupture and Achilles tendinopathy, suggesting a genetic component in injury susceptibility.
Contribution
A systematic review identifying genetic variants in COL1A1, COL5A1, ACTN3, MMP3, and GDF5 associated with musculoskeletal sports injuries.
Findings
COL1A1 rs1800012 may protect against ACL rupture.
COL5A1 rs1272 and rs13946 increase risk for ACL rupture and Achilles tendinopathy.
ACTN3 R577X XX genotype is linked to higher muscle injury risk.
Abstract
Background: Anterior cruciate ligament (ACL) rupture, Achilles tendinopathy, and stress fracture are common sports injuries with significant long-term effects on performance and health. Despite similar exposure, injury susceptibility varies among athletes, suggesting a genetic component. Variants in COL1A1, COL5A1, ACTN3, MMP3, and GDF5 genes influence collagen integrity, muscle performance, and extracellular matrix remodelling, making them potential risk factors. Objective: To systematically review associations between five selected genes and musculoskeletal injury risk. Methods: Following PRISMA 2020 guidelines, PubMed, EMBASE, SPORTDiscus, and Web of Science were searched for studies examining these genes in relation to sports injuries. Data were extracted using Covidence and assessed for quality via the Newcastle–Ottawa Scale (NOS). Results: Twenty-six studies (n > 7000) were…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTendon Structure and Treatment · Genetics and Physical Performance · Sports injuries and prevention
1. Introduction
Sport-related musculoskeletal injuries are a major concern across all levels of athletic participation, causing a substantial burden on performance, career longevity, and long-term health. High-impact injuries such as anterior cruciate ligament (ACL) ruptures, Achilles tendinopathy, and stress fractures are very common and can lead to lengthy rehabilitation, impaired athletic performance, or even premature retirement from sport [1,2,3]. Traditionally, injury prevention initiatives have focused on identifying extrinsic and intrinsic factors that contribute to an athlete’s risk profile [3,4,5,6]. Extrinsic factors encompass external conditions encountered during training or competition, including abrupt increases in training volume or intensity, playing surface characteristics, inappropriate footwear, and environmental conditions. These factors directly influence mechanical loading patterns and may alter movement biomechanics, particularly under fatigue [5]. Conversely, intrinsic factors reflect individual characteristics such as biomechanical alignment, muscular strength deficits, joint laxity, and neuromuscular control. Impairments in proprioception or delayed muscle activation can compromise protective reflexes around joints, thereby increasing susceptibility to injury. A prior history of musculoskeletal injury constitutes a critical intrinsic risk factor, as residual weakness, altered movement mechanics, and incomplete tissue recovery may predispose athletes to reinjury [6]. Although these traditional risk factors are well established, they do not fully explain the pronounced inter-individual variability in injury susceptibility observed among athletes. Some individuals appear inherently more “injury-prone” despite optimal conditioning and technique, while others remain relatively “injury-resistant” even under comparable biomechanical and environmental demands. This variability has contributed to growing interest in the role of genetic predisposition in determining sports injury risk [7,8,9,10,11].
The rapid evolution of sports genomics over the past two decades—driven by advances in molecular genetics, genome-wide association studies (GWAS), and high-throughput genotyping technologies—has enabled researchers to identify genetic variants associated with both performance characteristics and musculoskeletal injury susceptibility [8,9,10]. Of particular interest are functional polymorphisms, which represent common variations in the DNA sequence that may alter protein structure or expression. These variants can influence key biological processes, including collagen synthesis and organisation, tendon elasticity, muscle fibre function, and inflammatory responses, ultimately shaping tissue resilience, mechanical integrity, and recovery capacity [10]. Understanding these genetic contributions may help explain why athletes with similar training histories and biomechanical profiles encounter different injury outcomes. Anterior cruciate ligament (ACL) rupture, Achilles tendinopathy and stress fracture are the most common injuries. Each is influenced by a range of mechanical, anatomical, and environmental factors [1,2,3,4,5,6,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26].
Table 1 summarises the typical injury mechanisms and evidence-based risk factors for the three focal injuries of this review, along with the genetic markers identified.
Genetic variants that influence tissue repair, collagen structure, tendon stiffness, or inflammatory signalling may interact with behavioural and environmental factors—such as training load or nutritional status—to shape an athlete’s injury profile [27,28,29]. These interactions likely contribute to the heterogeneity in injury outcomes even among athletes with similar exposure levels. This systematic review focuses on five key genetic polymorphisms (COL1A1, COL5A1, ACTN3, GDF5, and MMP3) that have shown significant associations with musculoskeletal sport injuries. Each of these genes plays a distinct role in the structure or function of connective tissues and muscle, and thus each may contribute to injury risk in different ways [9,10,11]:
- •COL1A1 (Collagen Type I Alpha 1 Chain): Encodes a subunit of type I collagen, the primary structural protein found in ligaments, tendons, and bone. A well-studied variant in this gene (the rs1800012 polymorphism in the Sp1-binding site) alters collagen fibril properties, which can reduce tissue tensile strength. As a result, this variant has been associated with an increased risk of soft tissue injuries, such as ligament tears and tendon ruptures [16,17].
- •COL5A1 (Collagen Type V Alpha 1 Chain): This gene encodes a component of type V collagen, which serves as a regulator of collagen fibril formation and organisation in connective tissue. The rs12722 polymorphism of COL5A1 has been associated with variability in tissue flexibility and stiffness, and it is known to affect one’s susceptibility to tendinopathy and ligament injuries. In particular, this polymorphism is associated with a greater prevalence of Achilles tendinopathy and ACL ruptures, suggesting that collagen V reflects a person’s ability for structural tissue integrity and resilience to injury in tendons and ligaments [30].
- •ACTN3 (Alpha-Actinin-3): Encodes α-actinin-3, a protein predominantly present in fast-twitch (type II) muscle fibres, which serve an important role in the performance of fast, explosive movements. A common polymorphism in the gene, R577X, either permits the presence (R allele) or absence (X allele) of the functional protein α-actinin-3. The absence of α-actinin-3 (XX genotype) changes muscle fibre composition and may compromise power output and fatigue resistance. Genotype XX has been associated with increased risk for specific musculoskeletal injuries, particularly those incurred while participating in sports that require powerful sprinting, jumping, or rapid directional changes, in which protection by fast-twitch fibres is critical [31].
- •GDF5 (Growth Differentiation Factor 5): Encodes a growth factor that plays an important role in joint development, maintenance of cartilage, and tissue repair. Polymorphisms in GDF5 (e.g., rs143383) reduce the production of growth factor and impede cartilage formation and repair. The genetic variant is well characterised for its association with increased risk of osteoarthritis in weight-bearing joints, which leads to the inference that there is a higher risk of joint injuries and increased recovery time from tissue damage [32].
- •MMP3 (Matrix Metalloproteinase-3): Encodes an enzyme that degrades extracellular matrix components and is active in tissue remodelling and repair after injury. A functional polymorphism in the MMP3 gene promoter (commonly referred to as the 5A/6A polymorphism) influences the level of MMP3 expression. The high-activity variant can lead to more aggressive matrix breakdown, whereas the low-activity variant might slow tissue remodelling. These differences in MMP3 activity have been linked to variability in recovery rates and injury severity—for example, certain alleles are associated with a greater risk of tendinopathies and ligament damage due to either insufficient repair or excessive degradation of collagen fibres [33].
This systematic review aims not only to summarise the evidence but also to explore the potential for clinical applications in injury prevention. The review seeks to contribute to the advancement of personalised sport genomics and the development of targeted strategies to reduce injury risk among athletes.
2. Materials and Methods
Study Design: This systematic review study was conducted following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) 2020 guidelines [34].
Studies were selected based on predefined inclusion and exclusion criteria structured using the PICOS framework:
Inclusion Criteria
Population: Athletes of all ages, sexes, and competition levels, and physically active individuals.
Intervention/Exposure: Genetic polymorphisms (COL1A1, COL5A1, ACTN3, GDF5, or MMP3) or genome-wide signals assessed in relation to athletic performance, physiological traits, or injury outcomes.
Comparator: Athletes with different genotypes, or athletes compared with non-athlete control populations when relevant.
Outcomes: Injury incidence, severity, or recovery.
Study Designs: Observational studies (case–control and cohort), candidate-gene studies, and genome-wide association studies (GWAS) and meta-analyses.
Exclusion criteria included: (1) studies not involving athletes or sports injuries; (2) studies not involving human genetic data; (3) reviews, commentaries, editorials, or conference abstracts without primary data; (4) animal or in vitro studies; and (5) studies lacking extractable results.
Data Sources and Search Strategy: PubMed, EMBASE, SPORTDiscus (EBSCOhost), and Web of Science were searched using Boolean combinations of gene symbols and injury terms (e.g., “COL5A1” AND “Achilles” OR “tendinopathy”; “COL1A1” AND “ACL”; “ACTN3” AND “injury”). Reference lists and the relevant grey literature were screened. The time scale for this review was limited to the last 20 years (until 1 August 2025).
Study Selection: All search results were imported into Covidence systematic review software 2025 [35], and duplicates were removed. Titles and abstracts were screened against the eligibility criteria by two reviewers. Articles deemed potentially relevant were retrieved in full and assessed independently by two reviewers. Disagreements were resolved through discussion. Following the PRISMA flow diagram (Figure 1), 26 studies were identified that met the eligibility criteria.
Data Extraction: Data from included studies were extracted using a structured framework template on Covidence. A standardised extraction template was developed to ensure consistency across all records. For each eligible study, the following information was systematically recorded: study characteristics (first author, year of publication, country, and study design), population details (athlete status, sample size, and sex), genetic markers (specific genes and single-nucleotide polymorphism investigated), injury focus (type of musculoskeletal injury), outcomes (association between genetic variants and injury risk), and main results. The summary of all studies is presented in Table 2.
Quality Assessment: The Newcastle–Ottawa Scale (NOS) [56] was applied to assess the risk of bias. The NOS evaluates studies across three domains, selection of study groups, comparability of cases and controls or cohorts, and outcome/exposure assessment, with a maximum possible score of nine stars. Higher scores indicate a lower risk of bias. Studies were assessed according to their design using either the case–control or cohort version of the NOS. Each study was checked individually, and if it met the NOS criteria, it was awarded a star. The results of the quality assessment are presented in separate tables for cohort (Table 3) and case–control studies (Table 4). McCabe et al.’s 2018 study [50] is a cross-sectional study; therefore, its NOS rating is very low.
3. Results
A total of 26 studies met the inclusion criteria, as shown in the PRISMA flow diagram (Figure 1). These comprised case–control and cohort designs and involved physically active individuals or athletes across a broad range of sports, including football, rugby, athletics, and mixed sporting disciplines (see Table 2). The samples represented diverse populations from Europe, Asia, Australia, South America, and Africa. Among all investigated variants, polymorphisms within collagen-encoding genes (COL1A1, COL1A2, and COL5A1) were the most frequently examined. By contrast, despite its biological relevance, relatively few studies have focused on GDF5.
Given heterogeneity across study designs, genetic markers assessed, and injury outcomes, a qualitative synthesis was undertaken. Findings are presented across three primary categories: (1) anterior cruciate ligament (ACL) rupture, (2) Achilles tendinopathy and rupture, and (3) stress fractures and bone-related injuries. A small number of studies examined broader soft tissue or muscle injuries; these are summarised separately. Within each category, results are synthesised on an individual gene/polymorphism basis. Where multiple studies examined the same polymorphism or gene, patterns of association and consistency of evidence are reported. A meta-analysis was not performed due to variability in study design, outcome measures, and reporting standards.
3.1. ACL-Related Genetic Association Studies
Several genetic variants have been associated with ACL rupture susceptibility, but the evidence has overwhelmingly implicated collagen genes (COL1A1, COL1A2, and COL5A1) and matrix metalloproteinases (MMP3).
3.1.1. COL1A1
Several studies have examined the functional rs1800012 polymorphism in the COL1A1 Sp1-binding site. Posthumus et al. [16] reported that the TT genotype was completely absent among 117 South African ACL rupture cases, suggesting a strong protective effect. Collins et al. [55] later corroborated this in a pooled analysis of independent cohorts from South Africa and Sweden. In their dataset—including 350 cruciate ligament ruptures and 581 controls—the rs1800012 TT genotype was rare overall but markedly underrepresented in injured groups. Only 0.3% of cruciate rupture cases carried the TT genotype compared with 4.1% of controls (p = 0.0002), reinforcing its protective role. Perini et al. [45] investigated COL1A2 variants in 146 ACL rupture cases and 192 controls across multiple sports. They found a strong association with rs42524 and rs2621215: the CC genotype of rs42524 had a six-fold risk increase, while the GG genotype of rs2621215 had a four-fold risk increase. More importantly, the presence of COL1A1 rs1107946 in the GT or TT genotype with COL1A2 wildtype alleles had a protective effect (OR = 0.25).
3.1.2. COL5A1
Posthumus et al. [41] demonstrated a sex-specific effect of the rs12722 polymorphism, with the CC genotype underrepresented (and thus protective) in female ACL cases, while the TT genotype was enriched. O’Connell et al. [44] extended these findings in a large multi-population case–control study involving 333 ACL rupture cases and 378 active controls from South Africa and Poland. rs12722 in COL5A1 was independently associated with the risk of ACL rupture in female athletes, and the haplotype based on the combination of alleles at the rs12722 of COL5A1 and rs970547 of COL12A1 was significantly overrepresented in female cases, thus underscoring the importance of sex-specific gene–gene interactions. Lulińska-Kuklik and colleagues [17] further examined COL5A1 variants in 134 Polish professional male footballers with surgically confirmed ACL rupture compared to 211 healthy controls. They found no differences for rs12722 or rs13945, but the rs13946 variant showed a significant association in a dominant mode of inheritance (p = 0.039). More recently, Rodas et al. [54] analysed 46 elite footballers at FC Barcelona and showed that the rs13946 CC genotype was significantly more frequent in female players with ACL rupture, reinforcing sex-specific susceptibility.
3.1.3. MMP3
Two studies evaluated the role of matrix metalloproteinase-3 (MMP3) in ACL injuries. Simunić-Briški et al. [47] investigated 95 ACL rupture cases and 92 controls, reporting that risk genotypes rs591058 TT, rs650108 GG, and rs679620 AA were significantly overrepresented in cases. Haplotype analysis identified T–G–A as the risk holotype and C–A–G was protective. Conversely, Malila et al. 2011 [49] examined 86 ACL cases and 100 controls in a Thai cohort, finding no overall effect of the −1612 5A/6A polymorphism, though a subgroup analysis revealed that the 5A+ genotype was more frequent in contact injury cases.
3.2. Achilles Tendinopathy (AT)/Rupture Studies
3.2.1. COL5A1
Evidence implicates COL5A1 in chronic Achilles pathology. September et al. [21] assessed two independent cohorts (South Africa 93 AT vs. 132 controls; Australia 85 AT vs. 210 controls) for several COL5A1 variants (including rs12722, rs3196378, rs13946, and others). The rs12722 CC genotype was protective for tendinopathy in both cohorts (AUS OR ≈ 0.42; SA OR ≈ 0.38). In the Australian sample, rs3196378 AC elevated the risk of tendinopathy. Brown et al. [41] (UK case–control study; 112 AT, 227 controls) investigated COL5A1 rs12722/rs3196378 along with other ECM (extracellular matrix) genes; they reported that the C allele rs12722 was protective for AT, while other variants in the ECM increased risk.
3.2.2. MMP3
MMP3 variants have shown consistent associations with chronic Achilles pathology. Raleigh et al. [22] reported elevated tendinopathy risk among individuals carrying rs679620 GG, rs591058 CC, or rs650108 AA genotypes, and identified a protective ATG haplotype. Briski and colleagues [46] demonstrated a similar pattern in a Croatian athlete sample (63 AT vs. 92 retired athlete controls), where rs650108 GG (OR = 2.46) and rs679620 AA (OR = 3.14) were significantly overrepresented in cases. Recent findings from Brazier et al. [43] extended the evidence base by analysing elite male rugby athletes. This case–control study identified key roles for both MMP3 rs679620 and COL5A1 polymorphisms in soft tissue injury susceptibility. The TT genotype of MMP3 rs679620 was significantly overrepresented in players with ligament ruptures and sprains. The T allele appeared protective against tendinopathy. For COL5A1, carriers of the C allele at rs12722 were markedly more common in the tendon rupture group compared to non-injured athletes, conferring an eight-fold increased risk. When COL5A1 rs12722 and rs3196378 were analysed together, the T-C haplotype was enriched in tendon rupture, ligament sprain, and overall injured groups.
3.3. Stress Fracture Studies
3.3.1. COL1A1
Miyamoto-Mikami et al. [57] analysed a Japanese sample (Stage 1 cross-sectional n = 1667; Stage 2 prospective n = 508) and reported the differential role of rs1107946; the C allele showed female-specific antagonistic effects: higher fatigue fracture risk (OR ≈ 2.4) but lower muscle injury risk (OR ≈ 0.46). Mechanistic profiling suggested that this allele may lower bone mineral density (BMD), reduce muscle–tendon stiffness, and shift collagen composition toward a greater α1-chain homotrimer ratio, indicating a potential bone–muscle trade-off.
3.3.2. COL5A1
Evidence from Jacob et al. 2022 [51] shows that the COL5A1 rs12722 TT genotype increases the risk of bone and stress injuries in elite Australian Football League athletes (7-season prospective AFL cohort; n = 46 with 992 injuries). Varamenti et al. [52] further reported that rs12722 TT was protective for tendon and muscle injuries, emphasising the complex role of type V collagen across tissue types.
3.3.3. GDF5
Zhao et al. [25] conducted a prospective cohort study among 1398 Chinese male infantry recruits undergoing eight weeks of basic training. The study reported a 13.5% incidence of radiologically confirmed stress fractures. In addition to established risk factors such as prior fracture history and lower pre-training exercise level, a significant genetic association was observed for the GDF5 rs143383 polymorphism. The T allele was more prevalent among cases (79.1% vs. 68.4% in controls), conferring a 1.75-fold increased risk (95% CI: 1.35-2.28).
3.4. Broader Muscle and Soft Tissue Injuries
Five studies [36,37,38,39,40] from different parts of the world (Brazil, Italy, Spain and Sweden examined the role of the ACTN3 polymorphism in soft tissue injuries, and in most studies, XX genotype carriers documented a higher risk of injury. The sample sizes are relatively small; therefore, caution is warranted.
4. Discussion
This review explored the role of genetic variation in musculoskeletal injury susceptibility alongside established biomechanical and training-related factors. Collagen-related loci (COL1A1/COL1A2/COL5A1) affect fibril structure and tissue mechanics, modulating ligament and tendon resilience. MMP3 variants may alter extracellular matrix turnover, influencing repair dynamics after microtrauma. ACTN3 loss of function (XX genotype) is consistently linked to higher non-contact muscle and soft tissue injury burden, plausibly via reduced fast-twitch function. GDF5 variation may contribute to bone stress responses. Overall, the results lend support to the notion that genetic variation is an important determinant of injury susceptibility alongside extrinsic–intrinsic risk factors, including, but not limited to, biomechanics, training loads and exposure to surfaces. It is noteworthy that many of the associations noted were sex-specific, context-dependent or moderated by gene–gene interactions, which reflects the complex polygenic nature of risk of musculoskeletal injury. However, heterogeneity, modest sample sizes, and limited representation of non-Caucasian cohorts constrain inference. Functional validation and broader approaches—polygenic risk scores, GWAS, and gene–environment analyses—are needed to refine effect estimates and translation to practice.
4.1. Interpretation of Key Findings of ACL Rupture
The COL1A1 gene encodes a subunit of type 1 collagen, which is the primary structural protein in ligaments, tendons, and bone, critical for tissue tensile strength [16]. Variants in this gene can alter collagen transcription, protein structure, or post-translational modification, influencing the ligament’s mechanical structure and resilience. This review highlights a protective effect of the COL1A1 rs1800012 TT genotype against ACL rupture. Studies have reported this genotype to be either absent or significantly underrepresented in individuals with ACL ruptures compared to controls [45,55]. This strong association suggests that individuals carrying the TT genotype may have altered type 1 collagen structure or expression that enhances ligament resistance, potentially making them less susceptible to rupture [43]. Beyond direct effects, gene–gene interactions are also crucial; for instance, the COL1A1 rs1107946 GT or TT genotype conferred a protective effect when combined with COL1A2 wildtype alleles, suggesting that the overall integrity and resilience of the ACL’s complex collagenous extracellular matrix is modulated by a combination of genetic factors [45].
The COL5A1 gene encodes a component of type V collagen, which plays a fundamental role in regulating collagen fibril formation and organisation within connective tissues [30]. Polymorphisms in COL5A1, particularly rs12722 and rs13946, are known to influence tissue flexibility and stiffness, critical for ligament function and injury resistance [30]. Consistent findings in several studies link these two variants with increased risk for ACL rupture, often demonstrating sex-specific effects [17,40,41]. The rs12722 TT genotype was frequently overrepresented in female ACL rupture cases, and the rs13946 CC genotype was significantly more common in female elite footballers with ACL injuries [40]. The physiological implication is that variations in COL5A1 can lead to altered collagen fibril diameter and packing, resulting in ligaments that may be either too stiff or too lax. This can compromise the ACL’s ability to effectively absorb and dissipate mechanical stress, thereby increasing susceptibility to rupture, particularly in female athletes, where additional biomechanical and hormonal factors may interact with these genetic predispositions [40]. A recent meta-analysis by our group [58] showed a significant effect of rs12722 on lower limb musculoskeletal injuries in the recessive model of inheritance (OR = 1.28 (CI 1.03–1.57, p = 0.021) and ACL (OR = 1.31 (CI 1.03–1.67, p = 0.026).
MMP3 (matrix metalloproteinase-3 gene) encodes an enzyme vital for the degradation and remodelling of extracellular matrix components, a process crucial for tissue repair and adaptation [33]. Studies have shown variable associations between MMP3 polymorphisms and ACL rupture risk. Specific genotypes (rs591058 TT, rs650108 GG, and rs679620 AA) and a T-G-A risk haplotype were significantly overrepresented in non-contact ACL rupture cases [47]. This suggests that genetic variations leading to dysregulated MMP3 activity can disturb the delicate balance of collagen synthesis and degradation within the ACL. Excessive MMP3 activity could lead to accelerated breakdown of the collagen matrix, weakening the ligament, whereas insufficient activity might impair proper remodelling and repair following microtrauma [33].
4.2. Achilles Tendinopathy/Rupture
Evidence from this review consistently implicates COL5A1 and MMP3 genes in susceptibility to Achilles tendinopathy and rupture. COL5A1 plays a fundamental role in regulating collagen fibril organisation and overall tissue mechanics. Dysregulation of type V collagen can lead to altered tissue properties, making tendons more susceptible to repetitive strain and microtrauma [30]. The review indicates that the COL5A1 rs12722 CC genotype is consistently protective against Achilles tendinopathy, suggesting that this variant may contribute to a more resilient tendon structure capable of withstanding chronic loading [21,41]. However, the role of COL5A1 is complex and potentially injury-specific because the rs12722 C allele was also reported to be associated with a markedly higher risk of acute tendon rupture in elite male rugby players [42]. Physiologically, these findings suggest that while certain COL5A1 variants might optimise the collagen fibril structure in a way that protects against the chronic degenerative characteristics of tendinopathy, different genetic alterations or the same allele in a different context might predispose the tendon to acute failure under extreme, sudden loads by affecting its elastic limits or ultimate tensile strength.
The role of MMP3 in extracellular matrix (ECM) degradation and remodelling is crucial for Achilles tendon health, as proper repair and adaptation are essential to withstand high mechanical loads. A few studies show that specific MMP3 risk genotypes (e.g., rs679620 GG/AA, rs591058 CC, and rs650108 AA) [22,46,48] are associated with an increased risk of Achilles tendinopathy, often with synergistic effects when interacting with COL5A1 variants [22]. Physiologically, these variants likely lead to a state of chronic, imbalanced extracellular matrix turnover, where excessive collagen breakdown or impaired repair capacity weakens the tendon, making it vulnerable to repetitive microtrauma and the development of tendinopathy. Conversely, one study reported that the T allele of MMP3 rs679620 is protective against tendinopathy, despite its TT genotype being linked to higher ligament injury risk [42]. This complex pattern suggests that MMP3’s influence is highly dependent on the specific variant, the type of tissue, and the exact nature of the injury (chronic degeneration versus acute rupture), reflecting its diverse roles in tissue homeostasis and repair.
4.3. Stress Fracture
Relatively few studies met the inclusion criteria examining stress fractures in athletes and the specific genes reviewed here, especially compared with the larger body of research on the ACL and Achilles. This makes it difficult to draw firm conclusions, but the available evidence highlights potential roles for COL1A1, COL5A1, and GDF5.
As the primary structural protein of bone, COL1A1 is central to bone strength and elasticity. The rs1107946 C allele shows a striking female-specific effect: increasing fatigue fracture risk while reducing muscle injury risk. This allele has been linked to lower bone mineral density and an altered collagen composition (more α1 homotrimers), making bone more vulnerable to microdamage under repetitive loading [26]. These findings suggest a possible bone–muscle trade-off, where genetic adaptations that benefit one tissue may compromise another tissue.
COL5A1, although known for its role in collagen fibril organisation, also appears relevant to bone integrity. The rs12722 TT genotype was associated with a higher incidence of bone injuries, including stress fractures, in elite athletes [52,58]. This suggests that variants in COL5A1 may reduce the bone matrix’s ability to withstand cumulative stress. However, this was a pilot study with a very small sample size, which limits the strength and generalizability of the conclusion.
Finally, GDF5, a key regulator of cartilage and bone development, has limited but notable evidence linking it to stress fractures. A large prospective cohort study of Chinese infantry recruits reported that the rs143383 T allele increased stress fracture risk by 1.75-fold, likely reducing GDF5 production and impairing bone remodelling [25]. Together, these findings suggest a potential role for genetic variation in bone stress responses, though further large-scale, athlete-focused studies are essential.
4.4. Broader Muscle and Soft Tissue Injuries
ACTN3 encodes α-actinin-3, a structural protein expressed almost exclusively in fast-twitch muscle fibres, which underpin explosive strength and high-velocity movements in sport. A common polymorphism, R577X, determines whether the protein is present (R allele) or absent (X allele). Individuals with the XX genotype lack functional α-actinin-3, leading to shifts in muscle fibre composition and function [31]. Across multiple studies, the XX genotype has been consistently linked to a greater incidence, odds, and severity of non-contact muscle and soft tissue injuries. Physiologically, α-actinin-3 deficiency reduces peak power output and alters fatigue resistance in fast-twitch fibres, diminishing their ability to withstand repetitive high-intensity loading. This may also affect tendons and ligaments [31]. When muscle function is compromised, surrounding connective tissues are exposed to greater mechanical strain, as they must compensate for reduced force absorption and stability. Over time, this imbalance can elevate the risk of secondary injuries such as tendinopathy, ligament sprains, or even ruptures, particularly in sports demanding rapid and powerful movements [59].
4.5. Limitations and Future Directions
Although the findings of this review seem promising, the research is still limited and heterogeneous. Most studies focused on small samples; therefore, they are underpowered, and utilised mostly Caucasian/European samples, limiting the possibility for generalisation to larger and more diverse populations.
The current review is also limited in its scope, as there are only a limited number of studies from different geographical areas, most of which had none or only single studies with small samples, limiting our ability to carry out a review/analysis at different levels to assess how genetic/allele frequency variations may affect the risks/injuries.
There is a need for larger, multi-ancestry, multi-country/continental population-based comprehensive studies on different sport modalities and injuries to further this field and use genomic analyses for personalised therapies/exercise prescription.
A meta-analysis was not undertaken due to the high degree of heterogeneity across the included studies, encompassing substantial variation in population characteristics, methodological approaches, genetic markers assessed, and the injury outcomes used to define musculoskeletal phenotypes. Existing meta-analyses of specific genes and aligned variants, such as COL5A1, demonstrate that when study designs, phenotype definitions, and analytical models are sufficiently standardised, robust pooled estimates can be generated [58,60]. These findings indicate that, with greater methodological alignment in future research, the contribution of these variants to musculoskeletal injury risk will be more precisely characterised.
Evidence for certain genes, such as GDF polymorphisms in relation to stress fractures in athletes, was relatively limited. While GDF5 is widely recognised for its involvement in joint development, cartilage maintenance, and its association with osteoarthritis [61], research directly linking its variants to stress fractures in athletic populations is limited.
Additionally, most research has focused on single-nucleotide polymorphisms, whereas musculoskeletal injury risk is likely to be polygenic, requiring broader approaches such as genome-wide association studies (GWAS) and polygenic risk scores.
Other gaps include limited exploration of gene–gene and gene–environment interactions and insufficient attention to sex-specific effects. Lastly, while associations have been documented in the literature, functional validations are warranted to provide evidence for an association between genetic variants and biological processes relevant to musculoskeletal injury, such as collagen remodelling after running a marathon, muscle repair, and regulating inflammation. Advancing the evidence in the literature will be valuable in meaningfully interpreting the state of the science as we apply what we know about genetic studies, interventions, and practice, continuing to develop more personalised targeted methods for injury prevention and management.
4.6. Clinical Implications and Future Directions
Genetic profiles could complement screening to personalise training load management, neuromuscular conditioning, and recovery protocols. For example, athletes with COL5A1 risk variants for Achilles tendinopathy may benefit from targeted tendon loading and gradual progression; female athletes with COL1A1 variants linked to fatigue fractures may require proactive bone health monitoring, nutrition optimisation, and workload periodisation. Translation must proceed cautiously, given the current evidence limits and ethical considerations.
Importantly, a further methodological challenge lies in quantifying the relative influence—or practical “weight”—of individual genetic variants within the inherently polygenic and multifactorial landscape of sports injuries. Emerging work using cumulative or total genotype scoring illustrates this complexity. Studies demonstrate that injury risk reflects the combined effect of multiple loci rather than the action of single variants in isolation, highlighting the limitations of monogenic interpretations and underscoring the need for integrated polygenic models in future applied practice [10,62].
As research becomes increasingly standardised in phenotype definitions, methodological quality, and analytical frameworks, these polygenic approaches—alongside high-quality meta-analyses on specific markers such as COL5A1 [58,60]—are likely to refine our ability to interpret genetic susceptibility and translate it into meaningful, evidence-based athletic management strategies.
5. Conclusions
Variants in COL1A1, COL5A1, ACTN3, MMP3, and GDF5 are implicated in susceptibility to ACL rupture, Achilles tendinopathy, and stress fractures. While genetic profiling alone is not a definitive predictor, integrating genomic insights with biomechanics, training data, and clinical monitoring may enhance personalised prevention and rehabilitation. Larger, diverse cohorts and mechanistic studies are essential before routine implementation.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Hoenig T. Hollander K. Popp K.L. Fredericson M. Kraus E.A. Warden S.J. Tenforde A.S. International Delphi consensus on bone stress injuries in athletes Br. J. Sports Med.202559789010.1136/bjsports-2024-10861639638438 · doi ↗ · pubmed ↗
- 2Mansfield K. Dopke K. Koroneos Z. Bonaddio V. Adeyemo A. Aynardi M. Achilles tendon ruptures and repair in athletes—A review of sports-related Achilles injuries and return to play Curr. Rev. Musculoskelet. Med.20221535336110.1007/s 12178-022-09774-335804260 PMC 9463425 · doi ↗ · pubmed ↗
- 3Filbay S.R. Skou S.T. Bullock G.S. Le C.Y. Räisänen A.M. Toomey C. Ezzat A.M. Hayden A. Culvenor A.G. Whittaker J.L. Long-term quality of life, work limitation, physical activity, economic cost and disease burden following ACL and meniscal injury: A systematic review and meta-analysis for the OPTIKNEE consensus Br. J. Sports Med.2022561465147410.1136/bjsports-2022-10562636171078 · doi ↗ · pubmed ↗
- 4Saragiotto B.T. Di Pierro C. Lopes A.D. Risk factors and injury prevention in elite athletes: A descriptive study of the opinions of physical therapists, doctors and trainers Braz. J. Phys. Ther.20141813714310.1590/S 1413-3555201200500014724845023 PMC 4183252 · doi ↗ · pubmed ↗
- 5Aicale R. Tarantino D. Maffulli N. Overuse injuries in sport: A comprehensive overview J. Orthop. Surg. Res.20181330910.1186/s 13018-018-1017-530518382 PMC 6282309 · doi ↗ · pubmed ↗
- 6Taimela S. Kujala U.M. Osterman K. Intrinsic risk factors and athletic injuries Sports Med.1990920521510.2165/00007256-199009040-000022183329 · doi ↗ · pubmed ↗
- 7Bulgay C. Çakır V.O. Ergün M.A. The importance of genetic factors in sports injuries Innovative Research in Sport Sciences DalkılıçM. Duvar Publishing Izmir, Turkey 20233552
- 8Semenova E.A. Hall E.C.R. Ahmetov I.I. Genes and athletic performance: The 2023 update Genes 202314123510.3390/genes 1406123537372415 PMC 10298527 · doi ↗ · pubmed ↗
