Prevalence of Ten Gene Variants Involved in Muscular Phenotypes in a Mexican Mestizo Population
Luz Berenice López-Hernández, Guillermina Avila-Ramírez, Ariadna Del Villar-Morales, Mónica Alejandra Anaya-Segura, Luis Angel Montes-Almanza, Froylan Arturo García-Martínez, Antonio Miranda-Duarte, Carlos Antonio Sosa-Flores, Martha Eunice Rodríguez-Arellano

TL;DR
This study examines the frequency of ten gene variants related to muscle traits and disease risk in a Mexican Mestizo population and compares them to global data.
Contribution
The study provides new insights into the genetic makeup of Mexican Mestizos related to athletic performance and chronic diseases.
Findings
Genotype frequencies mostly fit Hardy–Weinberg equilibrium except for MSTN rs1805086 and FST rs1423560.
Significant differences in gene variant frequencies were found compared to 26 populations in the 1000 Genomes Project.
The findings may help future studies on genetic associations with athletic performance and chronic diseases in Mexicans.
Abstract
Several reports have provided evidence that there are genetic variants of genes such as MSTN, BDRKB2, ACTN3 and ADRB2 that are involved in a better response to adaptation during resistance or strength training, while other genes such as GRB14, AGT and END1 are reported to be associated with the risk of suffering from some diseases such as diabetes, hypertension or obesity. A cross-sectional study from a Mexican Mestizo population was performed to estimate the frequency of 10 gene variants in 8 genes involved in athletic performance or chronic degenerative diseases, MSTN (rs1805085, rs1805086), BDKRB2 (rs1799722), FST (rs1423560), ACTN3 (rs1815739), ADRB2 (rs1042713, rs1042714), GRB14 (rs8192673), AGT (rs699) and EDN1 (rs5370), and to compare frequencies from 26 populations reported in the Database of 1000 Genomes project. Genotype frequencies fitted the Hardy–Weinberg equilibrium,…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetics and Physical Performance · Muscle Physiology and Disorders · Nutrition, Genetics, and Disease
1. Introduction
Genetic variations significantly impact human muscle structure and function. These differences can lead to distinct anatomical, metabolic, functional, and behavioral characteristics in individuals. Some individuals may possess traits that make them particularly adept at certain physical activities or sports. In contrast, others might be more prone to chronic degenerative diseases like hypertension, diabetes and obesity (Table 1). Understanding these genetic influences can provide valuable insights into human health and performance [1,2]. The diversity observed in the characteristics of skeletal muscle among individuals can be traced back to a combination of genetic factors, environmental influences, or an interplay of both. The role of genetics in determining muscle strength has been hinted at in studies involving twins, which have shown heritability estimates for muscle strength traits to be between 30% and 85%. These estimates, however, can vary based on the specific conditions under which muscle strength is measured. These conditions could include factors such as the limb involved, the angle of contraction, the speed of contraction, and the type of contraction. Despite these findings, the most compelling evidence for a genetic influence on muscle function has been provided by genetic association studies. These studies have shown that variations within an individual’s genome can significantly impact muscle function. These variations can manifest in several ways. They could be as subtle as changes in a single base within the DNA sequence or could involve small deletions and insertions in the sequence. More significant structural variations are also possible, such as alterations in the copy number (changes affecting 50 or more base pairs) [3]. Single nucleotide polymorphisms (SNPs) are the most prevalent and the most thoroughly studied. A single nucleotide variant has the capacity to introduce a stop codon, create a different protein, or even disrupt the secondary structure of the DNA molecule. These intricacies in DNA and the dynamic nature of the genome have been thoroughly explored in previous studies [4,5,6,7].
A recent review of the effects of genetic variations on endurance performance, muscle strength, and susceptibility to injury in competitive sports found that certain genetic variants are overrepresented in elite athletes. One such variant is the rs1464430 (c.275124 A > C) variant of the Insulin Like Growth Factor 1 Receptor gene (IGF1R), which is a critical regulator of muscle growth and repair. The IGF-1 signaling pathway is activated when IGF-1 binds to the IGF1R on the surface of muscle cells. This activates a cascade of signaling events that leads to increased protein synthesis and muscle growth. The rs1464430 (c.275124 A > C) variant of the IGF1R gene is associated with increased IGF1R expression. This means that people with this variant have more IGF1Rs on their muscle cells, which makes their cells more sensitive to IGF-1. This increased sensitivity to IGF-1 can lead to increased muscle growth and endurance. Studies have shown that the rs1464430 (c.275124 A > C) variant is more common in elite endurance athletes than in the general population. This suggests that this variant may play a role in determining who is well-suited for endurance sports. However, it is important to note that genetic variations are just one factor that influences athletic performance. Other factors, such as training, nutrition, and recovery, also play an important role [8].
In addition, the Alpha-actinin-3 gene (ACTN3), which is one of the genes that are linked to improved endurance performance, may also be connected to a higher risk of injuries. For instance, the TT genotype (XX variant) of the rs1815739 polymorphism (c.1729C > T) on the ACTN3 gene has been proven to enhance endurance exercise performance. However, other studies have indicated that this specific genotype is also associated with a higher likelihood of musculoskeletal tissue injury and exercise-induced muscular damage [9].
Hence, phenotypic variance is assumed to be influenced by genetic variations, which are known to differ across various ethnicities and population origins. Consequently, acquiring an understanding of the prevalence of these genetic variants within the adaptive and evolutionary context of the study population is critically important for conducting genetic association studies with sufficient statistical power and for obtaining reliable data that truly represent a population [10]. Phenomena such as bottlenecks, admixture, migrations, and adaptation to environmental conditions may influence gene variant frequencies in specific populations. Interestingly, a recent study highlighted ethnic differences in fatty-acid-derived desaturation indices, wherein insulin-resistant Black women exhibited a fatty acid pattern characteristic of higher insulin sensitivity in European populations [11]. Therefore, the polygenic profile associated with the muscle phenotypes of certain population groups emerges as an approach to investigate endurance in elite athletes and performance in professional sports by comparing allele and genotypic frequencies with the non-athletic population [12].
In general, increasing evidence suggests that particular variants in some genes such as rs1805085 and rs1805086 in the Myostatin gene (MSTN), rs1799722 in the Bradykinin Receptor B2 gene (BDKRB2), rs1423560 in the Follistatin gene (FST), rs1042713 and rs1042714 in the Adrenoceptor Beta 2 gene (ADRB2) and rs1815739 in the ACTN3 gene are involved in differential muscle adaptation to strength and endurance training among individuals. Variants such as rs8192673 in the Growth Factor Receptor Bound Protein 14 gene (GRB14), rs699 in the Angiotensinogen gene (AGT) and rs5370 in the Endothelin 1 gene (EDN1), on the other hand, are associated with chronic degenerative diseases like type 2 diabetes (T2D), essential hypertension (HT) and obesity (Table 2). The prevalence of genetic variants affecting muscular and metabolic phenotypes is largely unknown in Mexican Mestizos as a population group.
The Mexican Mestizo population is the largest group of people in present-day Mexico. This group includes individuals whose parents were born in Mexico and who do not belong to a specific indigenous population. Thus, certain factors such as small population size, geographical isolation, and consanguineous marriages within specific indigenous populations may have an impact on biomedical traits [27]. In this study, we conducted a thorough analysis of a series of genetic variants known to influence muscular performance in Mexican Mestizos. This is the first study to examine these genetic variants in this particular population group.
2. Results
This cross-sectional study included a total of 282 Mexican Mestizo individuals, comprising 195 males and 87 females (Table 1). The primary objective of this study was to meticulously analyze the frequency of ten distinct gene variants. These variants are found within eight genes that have been previously associated with athletic performance or chronic degenerative diseases. Additionally, the frequencies were compared to 26 other populations around the world using information sourced from the 1000 Genomes Project [28]. These 26 populations were chosen because of the availability of data to compare the 10 polymorphisms included in this study. Herein, we examined the following gene variants: rs1805085 and rs1805086 of MSTN, rs1799722 of BDKRB2, rs1423560 of FST, rs1815739 of ACTN3, rs1042713 and rs1042714 of ADRB2, rs8192673 of GRB14, rs699 of AGT and rs5370 of EDN1.
No statistically significant sex differences were detected in any of the variables studied, except for alcoholism and smoking (Table 1).
In our study, we sought to understand the dynamics of allele and genotype frequencies within our target population. To achieve this, we employed the Hardy–Weinberg Equilibrium (HWE) as a theoretical framework. The HWE postulates that both allele and genotype frequencies in a population would remain constant from generation to generation in the absence of other evolutionary influences.
Our first step was to estimate the number of homozygous and heterozygous carriers within the population. This was based on the observed allele frequencies. We then conducted a Chi-square test that was used to determine whether there was a significant difference between the observed genotype frequencies in our population and what would be expected under the HWE.
The results indicated that allele and genotype frequencies were generally consistent with HWE (Table 3) for most of the studied variants. However, exceptions were observed in the rs1805086 of MSTN (p = 0.024) and rs1423560 of FST (p < 0.01). These two SNPs exhibited significant differences in comparison to some American populations, as well as to all African populations. For FST rs1423560 and EDN1 rs5370, the most substantial differences were noted in Asian and European populations, while BDKRB2 rs19799722 and ACTN3 rs1925739 displayed differences across African, American, Asian and European populations. Notably, ADRB2 rs1042713 and rs1042714 exhibited more prominent differences with European populations, GRB14 rs8192673 displayed variations in relation to all Asian populations and AGT rs699 demonstrated differences across almost all African, Asian, American and European populations (Supplementary Material).
3. Discussion
The study of genetic variations linked to human muscle structure and function may have diverse implications, particularly in fields like sports medicine. Genetic case–control studies are a powerful tool for identifying genetic variants associated with complex diseases. The basic idea is to compare the allele frequencies of genetic markers between cases and controls. This holds true for both pathological conditions (e.g., diabetes, hypertension, and sarcopenia) and positive traits (e.g., muscle resistance in high-performance athletes) [10]. If a marker is associated with the disease or the feature of interest (characteristics of high-performance athletes), its allele frequency will be higher in cases than in controls. However, it is important to be aware of potential biases that can affect the results of genetic case–control studies. One of these biases is population stratification. If the cases and controls come from different populations with different allele frequencies, this can lead to false-positive associations [29]. Thus, in order to avoid spurious associations, researchers should consider population origins, substructure, admixture, and the statistical power obtained. Statistical power, defined as the probability to reject a null hypothesis (H0) whilst the alternative hypothesis (HA) is correct, is affected by sample size, the prevalence of the disease, the frequency of the genetic variant and the assumed inheritance model, among other factors [30].Therefore, examining genotype and allele frequencies within a specific population may be useful for two main purposes: (1) gaining insights into the genetic background of that specific population, (in this case, Mexican Mestizos, representing most present-day Mexicans) [29] and (2) enhancing the accuracy of case–control association studies. The statistical power of a genetic association study is influenced by the disease (trait) prevalence, allele frequency, risk conferred by the genotype and the sample size and the genetic model selected. These factors will generally increase the statistical power of the study. However, the relationship between these factors is not linear [31] (Supplementary Material).
In addition, another potential bias is Hardy–Weinberg disequilibrium (HWD). HWD occurs when the allele frequencies of a marker do not follow the expected proportions [32]. This can happen for a number of reasons, such as selection, inbreeding, or genotyping errors. Our study identified a notable departure from the HWE for the rs1423560 variant. This specific variant is situated in the 5′ untranslated region (5′ UTR) of the FST gene, in proximity to several transcription binding sites. Given its conservation and location, there is a strong indication that this variant is involved in gene transcription. Indeed, this particular variant was found to influence baseline muscle size and strength among African Americans [16]. It is noteworthy that the rs1423560 variant was described to fit the HWE in two populations of Caucasians from Perth, Western Australia, and from the US [16,33]. But when data from the HapMap project were consulted, the ASW (n = 98) group derived from African ancestry in southwestern USA was not in HWE [34]. Therefore, caution should be taken when performing genetic association studies.
Interestingly, a study by Curiel-Cervantes et al. showed that the MSTN SNP rs1805086 was linked to higher levels of body mass index (BMI) and waist/height ratio (WHtR) in heterozygous subjects, who showed a greater total and central obesity compared to the homozygous subjects for ancestral allele AA (OR BMI > 30 kg/m). Remarkably, the genotype frequency of AG was 5.4%, and the homozygous risk allele GG was absent (n = 1024), and fitted the HWE, which contrasts with the data found by the present study (Table 3) [35]. The discordance found between the study of Curiel-Cervantes et al. [35] and our study is unlikely to be the result of genotyping errors because real-time PCR is accurate for SNP genotyping; nonetheless, it could be a result of the difference in sample size between both studies.
Therefore, the wealth of data presented here enables more precise and larger studies with selected phenotypes regarding exercise adaptation and complex metabolic diseases in Mexican Mestizos. The allele frequencies and Hardy–Weinberg equilibrium (HWE) results presented here can be used by other researchers to design and conduct case–control genetic association studies in Mexican Mestizo populations for traits such as sarcopenia, obesity, muscle size, and strength response to resistance training.
4. Materials and Methods
4.1. Sample Collection and Study Design
A cross-sectional study that included 282 individuals (195 males and 87 females) was performed on the general population of Mexico City. The eligibility criteria for study participation aligned with those used by the blood bank as a screening system for blood donation. These criteria required participants to meet the following conditions: being between 18 and 65 years of age, weighing more than 50 kg, and fasting for at least 8 h prior to participation. Exclusion criteria encompassed the following: a recent history of respiratory symptoms, diarrhea, or dental infections within the past 14 days; recent ingestion of drugs (including antibiotics, antidiarrheal medication, analgesics, etc.) in the five days leading up to the donation; recent endodontic procedures, acupuncture, piercings, or tattoos within the past 12 months; a surgical intervention within the last 6 months; vaccination within the last 30 days; consumption of alcoholic beverages within 72 h prior to donation; and insulin-dependent diabetes. The participants were subsequently asked about their alcohol consumption, smoking habits, family history of cancer, and whether they had been diagnosed with diabetes and/or hypertension (Table 1). The study included participants aged between 18 and 65 years, with an average age of 37.18 years and an average BMI of 27.75 kg/m^2^. Blood samples were obtained from the National Institute of Rehabilitation Luis Guillermo Ibarra Ibarra’s blood bank. The study received approval from Institutional Committees, and informed written consent was obtained from all participants. The participants in this study were identified as Mexican Mestizos, representing the majority of present-day Mexicans [27], based on anthropological criteria [35]. In this context, Mexican Mestizos are defined as individuals born in Mexico who do not associate with or identify as part of any indigenous group.
4.2. DNA Isolation and Genotyping
Genomic DNA was isolated from peripheral blood leukocytes by the DTAB-CTAB method [36]. The description of this method is provided so that other groups find it easier to reproduce the experiment: A blood sample of 5–10 mL was collected in a tube with EDTA (ethylenediamine tetra-acetic acid) as an anticoagulant, it was then centrifuged at 2000 rpm for 15 min to separate the leukocyte package. In one sterile 1.5 mL microcentrifugation tube (Eppendorf), 600 µL of 8% DTAB (dodecyl trimethylammonium bromide) (Sigma, NaCl 1.5 M, EDTA 50 mM, Tris 100 mM pH 8.7) (Saint Louis, MO, USA) was added and in another Eppendorf tube 100 µL of 5% CTAB (cetyltrimethylammonium bromide) (NaCl 0.4 M) was placed. Next, 150 µL of the leukocyte package (from the freshly centrifuged samples) was placed in the 1.5 mL Eppendorf tube containing 8% DTAB, the tube with the blood sample was centrifuged again at 2000 rpm for 15 min, and another 150 µL of the leukocyte package was obtained and deposited into the Eppendorf tube with DTAB from the previous step. The mixture was heated in a dry bath incubator at 65–68 °C for 5 min. After that, 550 µL of 100% Chloroform was added to the same Eppendorf tube and was manually agitated vigorously for 5 min; the mixture was then centrifuged at 13,000 rpm for 10 min. Following centrifugation, two phases were visible—a proteinaceous phase at the bottom of the tube and a transparent aqueous phase on the surface. The upper aqueous phase was poured into the tube containing CTAB, mixed, and 750 µL of sterile water was added. It was gently mixed until the phases were combined and then allowed to rest for 10 min at room temperature. Subsequently, it was centrifuged at 10,000 rpm for 5 min. The supernatant was discarded, retaining the precipitate, to which 200 µL of 1.2 M NaCl was added to resuspended it. Then, 850 µL of 100% cold ethanol was added, mixed, and allowed to precipitate in the cold for 10 min (at −20 °C). It was then centrifuged at 10,000 rpm for 5 min. The supernatant was discarded again and 500 µL of 70% ethanol was added. It was once more centrifuged at 10,000 rpm for 5 min, repeating the process of discarding the supernatant after. The remaining precipitate was dried in a vacuum dryer or at room temperature. Finally, the precipitate was resuspended in sterile water.
Agarose gels and a Nanodrop ND-100 were used to assess DNA quality before real-time PCR. A total of 20 ng of genomic DNA from peripheral blood was used for the PCR. Genotyping was performed with real-time PCR using TaqMan probes (hydrolysis probes) specific for the variants; MSTN (rs1805085, C_9480364_10, and rs1805086, C_282184_30), BDKRB2 (rs1799722, C_1772941_10), FST (rs1423560, C_8884098_10), ACTN3 (rs1815739, C_590093_1_), ADRB2 (rs1042713, C_2084764_20, rs1042714, C_2084765_20), GRB14 (rs8192673, C_1356985_10), AGT (rs699, C_1985481_20) and EDN1 (rs5370, C_598677_1_) of Applied Biosystems, Foster City, CA, USA. PCR reactions were prepared using the standard conditions for 25 μL reactions, and the mixture was homogenized and placed in the dish; subsequently, we added 20 ng of DNA to each well. The reaction dish was sealed with adhesive film and gently centrifuged. The conditions for PCR were 95 °C, 10 min for the activation of the polymerase, 95 °C, 15 s for denaturation and an extension of 60 °C 1 min. The acquisition of data and allelic discrimination was carried out in LightCycler 480 II (Roche Diagnostics GmbH, Rotkreuz, Switzerland).
4.3. Statistical Analysis
Following the genotyping process, we conducted a thorough analysis of all the variants identified. Our primary objective was to determine whether these variants adhered to the Hardy–Weinberg Equilibrium (HWE) within our study population. The Hardy–Weinberg Equilibrium (HWE) is a statistical principle that describes the expected distribution of allele and genotype frequencies in a population under certain assumptions. These assumptions include random mating, no selection, no mutation and no migration. HWE is important for genetic association studies because it provides a baseline for comparing the observed allele and genotype frequencies in a study population to the expected frequencies under HWE. If the observed frequencies differ significantly from the expected frequencies, this suggests that one or more of the HWE assumptions has been violated. This could be due to a number of factors, such as population stratification, selection or genotyping errors. Therefore, we utilized a χ2 test, which is readily available online [37]. Once we had established the HWE status of our variants, we proceeded to compare the genotype and allele frequencies that we had obtained with those from twenty-six other populations. These populations were derived from the 1000 Genomes Project, a comprehensive resource that provides a detailed representation of human genetic variation.
For the statistical analysis, we employed SPSS software v. 18.0 (SPSS Inc., Chicago, IL, USA). This powerful tool allowed us to conduct a rigorous comparison of genotype and allele frequencies between our study population and the populations from the 1000 Genomes Project. We used the χ2 test in conjunction with Fisher’s exact test correction to compare these frequencies. This approach enabled us to identify any significant differences between populations. We set our p-value threshold at <0.05; any value below this threshold was considered indicative of statistically significant differences among populations.
4.4. Limitations of the Study
Although descriptive studies of allele frequencies in specific populations are a valuable tool for understanding the genetic diversity of human populations and identifying genetic variants that may be associated with disease or other traits, our study presented some limitations. These included a small sample size and a scarcity of information on individual characteristics, and we were unable to demonstrate why some of the variants were not in HWE; we limited our contribution to explaining how allele frequencies are important factors that may influence genetic case–control studies.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Lozano R. Gomez-Dantes H. Garrido-Latorre F. Jimenez-Corona A. Campuzano-Rincon J.C. Franco-Marina F. Medina-Mora M.E. Borges G. Naghavi M. Wang H. Burden of disease, injuries, risk factors and challenges for the health system in Mexico Salud Publica Mex.20135558059410.21149/spm.v 55i 6.730424715011 · doi ↗ · pubmed ↗
- 2Costa A.M. Breitenfeld L. Silva A.J. Pereira A. Izquierdo M. Marques M.C. Genetic inheritance effects on endurance and muscle strength: An update Sports Med.20124244945810.2165/11650560-000000000-0000022559317 · doi ↗ · pubmed ↗
- 3Lappalainen T. Scott A.J. Brandt M. Hall I.M. Genomic Analysis in the Age of Human Genome Sequencing Cell 2019177708410.1016/j.cell.2019.02.03230901550 PMC 6532068 · doi ↗ · pubmed ↗
- 4Peng C.K. Buldyrev S.V. Goldberger A.L. Havlin S. Sciortino F. Simons M. Stanley H.E. Long-range correlations in nucleotide sequences Nature 199235616817010.1038/356168 a 01301010 · doi ↗ · pubmed ↗
- 5Lohrer H.D. Tangen U. Investigations into the molecular effects of single nucleotide polymorphism Pathobiol. J. Immunopathol. Mol. Cell. Biol.20006828329010.1159/00005593911493762 · doi ↗ · pubmed ↗
- 6Ikeda Y. Genetic information theory—From the standpoint of molecular biology Nihon Ishikai Zasshi. J. Jpn. Med. Assoc.1967571551756068358 · pubmed ↗
- 7Shastry B.S. SN Ps: Impact on gene function and phenotype Methods Mol. Biol.200957832210.1007/978-1-60327-411-1_119768584 · doi ↗ · pubmed ↗
- 8Appel M. Zentgraf K. Krüger K. Alack K. Effects of Genetic Variation on Endurance Performance, Muscle Strength, and Injury Susceptibility in Sports: A Systematic Review Front. Physiol.20211269441110.3389/fphys.2021.69441134366884 PMC 8334364 · doi ↗ · pubmed ↗
