Determination of carriers of deafness-infertility syndrome in Peru
Luana Carolina Ventura Cuellar, Kevin Bryan Aviles Jascha, Hugo Hernan Abarca-Barriga

TL;DR
This study estimates the number of people in Peru who carry genes linked to deafness-infertility syndrome and finds higher carrier rates in poorer regions.
Contribution
The study provides the first estimate of DIS carrier frequency in Peru and links it to regional poverty levels.
Findings
Approximately 367,364 people in Peru are estimated to be carriers of DIS-related genes.
Around 57,442 people in Peru are estimated to have both deafness and infertility due to DIS.
Higher carrier frequencies were observed in regions with higher poverty rates.
Abstract
The prevalence of deafness-infertility syndrome (DIS) is approximately 1%. Genetic heterogeneity is one cause of homozygous copy number variants (CNVs) involving the CATSPER2 and STRC genes, which are associated with DIS and male infertility. Because the prevalence of DIS in Peru is unknown, we aimed to determine the frequency of carriers of DIS-related genes. In this descriptive crossover study, we evaluated the clinical histories and chromosomal microarray analysis results of patients at the Instituto Nacional de Salud del Niño Breña from 2015 to 2022. All patients with CNVs involving the CATSPER2 and STRC genes were included, and the frequencies of carriers and affected patients were determined using Hardy‒Weinberg equilibrium. Relative frequency differences were calculated using the chi-square test with goodness-of-fit for natural regions and poverty groups in Peru. Of 2,142…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomic variations and chromosomal abnormalities · Telomeres, Telomerase, and Senescence · Hearing, Cochlea, Tinnitus, Genetics
Introduction
Overall, 6 to 8% of the population has a rare disease, and 80% of these diseases have a genetic origin [1]. Similarly, it is estimated that 24% of people are carriers of at least one recessive disease (X-linked or autosomal) [2]. In addition, the frequency of copy number variations (CNVs) in DNA is at least 2% [3]. Futhermore, it is estimated that people have at least one heterozygous variant of an autosomal recessive disease, and on average, they can carry between 2 and 5 variants of autosomal recessive diseases [4]. Hearing loss causes greater learning difficulties, resulting in a higher incidence of poverty and lack of employment in adulthood [5]. Similarly, infertility is a common factor in psychological illnesses that could impact the economic sector [6].
Among these rare diseases, deafness-infertility syndrome(DIS) is characterized by a CNV of chromosome 15q15.3, in loss (homozygous deletion) at the DFNB16 locus, involving the CATSPER2 and STRC genes; which is characterized by asthenotherozoospermia (males), and prelingual sensorineural hearing loss (both sexes) of mild to moderate intensity [7–9]. In China, this syndrome was reported in three consanguineous families [9] and in France, the United States, Israel and the Netherlands, patients were reported in non-consanguineous families [7, 8, 10]. These previous findings suggest the importance of conducting studies in other populations to understand the importance of consanguinity in rare diseases.
Variants in the STRC gene results in syndromic sensorineural hearing loss (SNHL), due to the fact that some variants are associated with the CATSPER gene, while variants in the GJB2 gene are mainly related to nonsyndromic SNHL [11]. It is estimated that 2 out of every 1000 newborns are affected with sensorineural hearing loss (SNHL), 50–70% of these cases are of genetic origin, 75% of which have autosomal recessive inheritance.
Variants in the STRC gene are linked to 1–5% of patients with mild to moderate SNHL [12]. On the other hand, as SNHL is an under-diagnosed condition, a routine genetic test can be implemented as a suggestion to achieve a differential diagnosis in those cases of congenital hearing impairment [13] which allows a new public health criterion for premarital screening to be proposed as a preventative approach.
The CATSPER2 gene, which is involved in “capacitation, hyperactivation, acrosomal reaction and sperm motility” [12, 14], has been linked to male infertility [9]. Therefore, homozygous deletion of CATSPER2 can cause asthenoteratozoospermia [14], resulting in a decreased penetrative ability, impaired hyperactivation and the absence of response to progesterone [7]. Regarding the prevalence of deafness-infertility syndrome, few studies have been performed at the global and national scale [7–9, 12, 14, 15].
Being the aim of the study to determine the frequency of carriers DIS syndrome in Peru, based on the patients attended at the INSN, which is one of the most important national reference centers in pediatric care and the only institution that performs this genetic test. The results of this study could identify the region with the highest frequency of DIS; this knowledge can be applied at the public health level to raise awareness of the necessity of premarital genetic testing.
Methods
Study design
A cross-sectional study was performed in which patients who underwent chromosomal microarray analysis (CMA) at the National Institute of Child Health from 2015 to 2022 were included.
Study population
The study population was defined by the following selection criteria: patients who underwent CMA from 2015 to 2022 with quality indices above those stipulated by Affymetrix ^®^. Patients with medical records that were incompletely digitized in the database and those whose parents or guardians did not allow their participation were excluded from the study.
Sample
The medical records composed of all patients with DIS carriers from 2015 to 2022 found in the database of the Genetics & Inborn Errors of Metabolism (IEM) Service of the Instituto Nacional Del Niño were selected. For sample selection, non-probability convenience sampling was applied, with a statistical power of 80% with 95% CI.
Study variables
For DIS carriers, patients with deletion of CATSPER2 and STRC genes were included [7]. Both sexes were included, aged 0 to 18 years; who underwent the CMA test, with diagnoses such as neurodevelopmental disorders (intellectual disability, autism spectrum disorder, psychomotor developmental delay) and others (language delay, short stature, congenital anomalies, etc.). In addition, information such as parental age, CNV size, and uniparental disomies and parental consanguinity were extracted by homozygosity regions (ROH).
Likewise, patients, parents and grandparents were classified according to dsitricts and regions of origin. Additionally, to analyze the demographic characteristics of the participants, we classified them into poverty groups by considering the region of origin of the patients’ grandparents as a reference to ensure greater representativeness of our study, taking as a reference the technical report of the evolution of monetary poverty in the INEI for the year 2022 [16, 17].
On the other hand, the Hardy Weinberg equilibrium states that allelic and genotypic frequencies must remain constant throughout the generations of a population in order to maintain balance. For this to be fulfilled, the population must be infinite, there must be no individuals from another population, mating must be random, i.e., there must be no natural selection, and there must be no genotypes more effective than others [18].
Bias
Although the ideal approach is to study the general population, the INSN is a national reference institute for performing CMA in patients with diagnoses different from that of interest to our study, where the identification of a DIS carrier is incidental or secondary, which would allow extrapolation of our results to the general population. Furthermore, due to the number of cases found, further extension of the analysis was not possible. On the other hand, as this study is based on medical records (secondary sources), information bias may be present; thus, some data may have recording and digitization errors.
Procedure
CMA was performed according to the GeneChip CytoScan 750 K Array protocol (Affymetrix, USA ^®^) following the manufacturer’s instruction manual. The test includes 550 000 non-polymorphic markers and 200 436 markers with single nucleotide polymorphism. The cells in rows are scanned using the Chromosome Analysis Suite (ChAS) software (Affymetrix, USA ^®^). For gains or losses are due to the compromise of at least 50/25 markers respectively. In ROHs, a length of at least 10 Mpb is impaired [19].
Statistical methods
Descriptive analysis
For quantitative variables, a normal distribution was obtained, to then establish the mean and standard deviation (parents’ age). On the other hand, a non-normal distribution was used to establish median and interquartile range (patient age).
Absolute and relative frequencies were calculated for qualitative variables.
By applying the Hardy-Weinberg law, the possible frequency of carriers in the Peruvian population was estimated assuming the following premises: absence of migration, infinite population, absence of genetic drift, no natural selection, random reproduction and one genotype should not be more effective than another; applying the following formula:
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\left( {p + q} \right)^2} = {p^2} + 2pq + {q^2}$$\end{document}where p^2^: homozygous dominant.
2pq: heterozygous.
q^2^: recessive homozygote.
Thus, based on a total number of participants who underwent the CMA in this study, and data from the Peruvian population according to INEI. The formula was applied to estimate healthy, sick and carrier patients, upper and lower limits were estimated with a 95% CI [20, 21].
Bivariate analysis
The chi-square test with goodness of fit was used to verify whether there were differences in the proportions of DIS and CATSPER2 gene deletion carriers according to natural region or origin and poverty group according to the most recent INEI report. Patients were divided into poverty groups: Group 1 corresponded to the regions of Cajamarca; Group 2 corresponded to Amazonas, Ayacucho, Huancavelica, Huánuco, Loreto, Pasco and Puno; Group 3 corresponded to Ancash, Apurimac, Junín, La Libertad, Lima, Moquegua, Piura, Callao, and San Martin; Group 4 corresponded to Arequipa, Cusco, Lambayeque, Metropolitan Lima, Madre de Dios, Tacna, Ucayali; Group 5 corresponded to Ica and Tumbes [16, 17].
Statistical significance was defined as p < 0.05 and a confidence level of 95%. The following statistical programs were used: STATA version 15.0 and Jamovi version 2.3.
Ethical aspects
The present study complied with ethical research considerations: the data were used anonymously, personal information such as email address was not requested, and patient identity was anonymized; therefore, informed consent was not required. The information used in the study was obtained with the consent of the respective authorities of the Instituto Nacional Del Niño (institutional and ethical approvals, number N°237-2022-CIEI-INSN). Likewise, the Ethics Committee of the Universidad Cientifica Del Sur approved this project prior to its implementation (Registration number: PRE-15-2022-00459).
Results
In the present study, out of 2,142 patients who underwent CMA, 35 patients with variants were included; 26 patients had variants involving the STRC and CATSPER2 genes, and 9 patients harbored deletion of the CATSPER2 gene. The median age of the children evaluated was 4 years (RIC 3–6), the average age of the fathers was 39 years and this difference is 5 years greater than the age of the mother. There is a difference in the proportion of gene affectation only by CATSPER2 and those presenting CASTSPER2 and STRC (p = 0,004). At the level of natural region of origin, a clear difference is shown in those who are from the coast in relation to the rest of the natural regions (p < 0,05). The 54.3% of the cases found are female and 68.6% have no consanguinity according to the CMA (ROH < 0,05). Finally, the patients underwent the CMA test whose main diagnosis was neurodevelopmental disorders 71.4% (Table 1).
Table 1. Epidemiological data of children who underwent chromosomal microarray analysis at INSNVariable n % p
Age Children(median/IQR)4(3–6) Mother34.010.1 Father39.112.2 CNV size Minimum coordinate Minimum value43,553,262 Maximum value43,636,120 Maximum coordinate Minimum value43,684,983 Maximum value43,715,339 Sex Male1645.70.612 Female1954.3 Consanguinity (ROH > 0.5) Yes1131.40.028^£^ No2468.6 Gene involvement CATSPER2925.70.004^£^ CATSPER2 y STRC2674.3 Region of origin Region of child Coast2571.40.067 Highlands617.10.062 Rainforest12.90.061 Foreigners38.6- Mother’s region Coast1851.40.005^£^ Highlands1131.40.564 Rainforest38.60.319 Foreigners38.6- Father’s region Coast2365.70.035^£^ Highlands1028.60.411 Rainforest12.90.061 Foreigners12.9- Maternal grandfather’s region Coast1954.30.007^£^ Highlands1337.10.917 Rainforest000.020 Foreigners38.6- Maternal grandmother’s region Coast1748.60.003^£^ Highlands14400.901 Rainforest12.90.061 Foreigners38.6- Paternal grandfather’s region Coast21600.008^£^ Highlands1234.30.602 Rainforest12.90.050 Foreigners12.9- Paternal grandmother’s region Coast21600.008^£^ Highlands1131.40.449 Rainforest25.70.127 Foreigners12.9- Diagnostics neurodevelopmental disorders2571.4 others1028.6Neurodevelopmental disorders: intellectual disability, autism spectrum disorder and psychomotor developmental delay. Others: language delay, short stature, congenital anomalies, etcIQR: interquartile range. SD: standard deviation. CNV: copy number variants*SD and mean were calculated. ^£^p-value according to chi-square with goodness of fit according to the actual proportions of the different regions based on INEI data. Coast: 55.9%, highlands: 29.6% and jungle: 14.5%
Of a total of 26 patients with DIS, it was found that the origin of the patients with DIS according to departments at the national level was Lima 64.0% (n = 14) and Junín 8.0% (n = 2). However, three patients were found to be foreigners and were not included in the analysis. Of a total of 104 maternal and paternal grandparents of the DIS carriers, where 8 subjects of foreign origin were excluded, it was found that the most frequent origins of the grandparents were from Lima 38.5% (n = 37), Junín 12.5% (n = 12) and Cajamarca 9.4% (n = 9). Among the 9 patients with the CATSPER2 deletion, 100% were from Lima. Meanwhile, of a total of 36 maternal and paternal grandparents, it was found that the most frequent origins of the grandparents were from Lima 22.2% (n = 8), Piura 22.2% (n = 8) and Ayacucho 19.4% (n = 7) (Table 2).
Table 2. Districts of origin of patients carrying DIS and maternal and paternal grandparentsDistrictsDistricts of origin of carrier patientsDistricts of origin of maternal and paternal grandparentsDIS%CATSPER2%DIS%CATSPER2%Amazonas00.000.000.000.0Áncash00.000.055.200.0Apurímac00.000.000.000.0Arequipa00.000.044.200.0Ayacucho14.000.066.3719.4Cajamarca14.000.099.4513.9Callao00.000.000.000.0Cusco00.000.022.100.0Huancavelica00.000.044.200.0Huánuco14.000.022.112.8Ica00.000.000.000.0Junín28.000.01212.5411.1La Libertad14.000.066.325.6Lambayeque14.000.011.000.0Lima1464.09100.03738.5822.2Loreto00.000.022.100.0Madre de Dios00.000.000.000.0Moquegua00.000.000.000.0Pasco00.000.000.000.0Piura14.000.044.2822.2Puno00.000.000.000.0San Martín00.000.011.012.8Tacna00.000.000.000.0Tumbes14.000.011.000.0Ucayali00.000.000.000.0Total23100.0910096100.036100.0*Chi-square test with goodness of fit according to the actual proportions of the different districts based on INEI data^£^
Of a total of 104 maternal and paternal grandparents of the DIS carriers, where eight subjects of foreign origin were excluded (n = 96), the highest frequency of DIS carriers according to the poverty group established by the INEI was found in group 1 with 9.4% (n = 9) coming from the department of Cajamarca; being the only group with significant differences (p = 0.020). Regarding the maternal and paternal grandparents of the carriers of the CATSPER2 deletion, out of a total of 36, the highest frequency of grandparents of carriers with respect to the poverty group is found in group 3, which includes Lima; however, only groups 1, 2 and 4 present significant differences with the values expected by the INEI (p < 0,05). In both analyses, the frequency of foreigners was not considered because there is no report of the proportion of foreigners in the Peruvian population (Table 3).
Table 3. Districts groups with poverty groups of grandparentsPoverty groupsFrequency of DIS carriers%pCarrier frequency (deletion in CATSPER2)%*pGroup 1^£^ (4,48%)99,38%0,020513,89%0,006Group 2^£^ (11,64%)1414,58%0,369822,22%0,048Group 3^£^ (32,24%)3536,46%0,3771644,44%0,117Group 4^£^ (47,69%)3738,54%0,073719,44%< 0,001Group 5^£^ (3,95%)11,04%0,14300%0,224TOTAL96100,0%36100,0% p-value according to chi-square with goodness of fit according to the actual proportions of the different poverty groups based on INEI data^£^Group 1: CajamarcaGroup 2: Amazonas, Ayacucho, Huancavelica, Huánuco, Loreto, Pasco, PunoGroup 3: Ancash, Apurimac, Junin, La Libertad, Lima, Moquegua, Piura, Const. Prov. del Callao, San MartinGroup 4: Arequipa, Cusco, Lambayeque, Metropolitan Lima, Madre de Dios, Tacna, UcayaliGroup 5: Ica and Tumbes
Based on the Hardy-Weinberg equilibrium with a total population of 33 millions 396,700 inhabitants, it was found that 36,364 inhabitants would be carriers of DIS, with a frequency of 1.1%. While, the affected persons would range from 1 to 57,442 Peruvian population (CI 95%) (Table 4).
Table 4. Frequency and confidence interval for the DIS in the Peruvian populationDIS n %Frequency according populationConfidence interval at 95%.N°LowerUpperHealthy211998,933 029 33632 860 01533 168 935Carriers231,1367 363227 765536 685Sick0< 0,001< 0,001057 442Total2142100%33 millones 396 mil 700 habitants
By evaluating only the CATSPER2 gene deletion, and applying again the Hardy-Weinberg equilibrium, it was found that 133,587 Peruvian inhabitants would have a heterozygous deletion in Peru, with a frequency of 0.4% (Table 5).
Table 5. Frequency and confidence interval for CATSPER2 in the Peruvian populationCATSPER2 n %Frequency according populationConfidence interval at 95%.N°LowerUpperHealthy213399,633 263 11333 130 86233 332 578Carriers90,4133 5876 412265 838Sick0< 0,001< 0,001057 442Total2142100%33 millones 396 mil 700 habitants
Discussion
In the present study, 26 patients with heterozygous deletions in the CATSPER2 and STRC genes were reported; and based on the Hardy-Weinberg equilibrium, it was estimated that 1.1% of the national population could be carriers (367,363 Peruvian population). In a previous study in United States, a frequency of 1% DIS carriers is proposed, with a prevalence of 1 in 40,000 people [9]; this value is similar to the frequency found in this study. On the other hand, a national study (n = 400) reported 12 children with heterozygous deletions of DIS, where 3 out of every 100 people could carry this condition; however, the data may have been overestimated because the study was performed in a minor population [19].
A systematic review and meta-analysis estimated that 1.36% of a population without hearing problems could be autosomal recessive carriers for STRC variants [22]. In addition, it has been described that non-syndromic hearing loss is more frequently autosomal recessive, where approximately 50% involve the DFNB1 locus, caused by variants in the GJB2 and GJB6 genes [9]. The second most frequent cause of mild to moderate hearing loss belonged to homozygous deletions in STRC [12]. However, no CNVs involving only the STRC gene were found to contrast our findings with data from other investigations.
On the other hand, nine patients with deletions in the CATSPER2 gene were found. Likewise, it was estimated that the possible frequency of carriers according to Hardy-Weinberg’s equilibrium was 0.4% at national level, with a possible affection from 1 to 133,587 Peruvian inhabitants carrying this deletion. Currently, no frequencies have been found for the CATSPER2 gene, and worldwide there are studies where the CATSPER2 deletion is linked to the STRC deletion and the detection is secondary to the analysis of the STRC gene [22, 23]. Similarly, another study indicates that its detection is currently not possible through the application of clinical tests and for this reason it is underdiagnosed as a causal factor of male infertility [14].
One of the limitations of the present study is its cross-sectional nature, which does not allow us to establish a cause-effect relationship of the associated variables. In addition, although it would be ideal to study the general population, the INSN is a national reference institute for the performance of CMA in patients with diagnoses other than the disease of interest in our study; therefore, the identification of a DIS carrier is incidental or secondary, which could allow for an extrapolation of the results to the general population. Furthermore, due to the number of available cases, it was not possible to extend the analysis. In addition, as this study is based on medical records (secondary sources), there may be information bias; therefore, some data may have recording and digitization errors. Finally, we attempted to characterize the poverty levels of the families in the study, although it would have been ideal to obtain information on the origin of the grandparents at the district level for the best approximation of poverty. When applying Hardy-Weinberg equilibrium, the results would be an estimate but would not necessarily reflect the true value of the carrier frequency because the assumptions would not be fully met, such as migratory movements or non-random matching.
Finally, it was observed that the DIS carriers and their grandparents probably come from Lima and Junín. Meanwhile, CATSPER2 deletion carriers and their grandparents come mainly from Lima. Regarding the poverty groups, a higher frequency of DIS carriers was found in those families that came from group 1 and CATSPER2 in those families that came from groups 1 and 2, who are also in a group with a higher incidence of poverty at the national level according to the INEI, and show a p-significance [16, 17]; however, there were no other studies that could provide information on these data.
Conclusions
In this study, we report that the frequency of DIS carriers is similar to that reported in the United States, but there are no concise frequencies in Europe, Asia and Africa. Additionally, in the departments where we found a higher frequency of carriers was in Lima and Junín. Concerning the poverty group, those who belonged to group 1 would have a great possibility of having a higher frequency of DIS carriers.
The findings of this study could be a basis for new studies focusing on the importance of early diagnosis to reduce the impact of DIS on public health. Further research might be required to investigate the etiologies and various treatments in premarital or preconceptional counseling.
Recommedations
Due to the likelihood of finding a higher frequency than expected, it is essential to perform genetic studies such as multiplex PCR or multiplex ligand dependent probe amplification (MLPA) for the diagnosis of this condition. Thus, a prevalence study would allow early and timely management in terms of clinical decision making; with respect to the application in health management, it will allow proposing a new public health criterion starting from a preventive approach to premarital screening. On the other hand, as hearing loss is underdiagnosed, it should be considered as an initial screening for DIS and should be included in the differential diagnosis of people with hearing loss.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Yokota Y, Moteki H, Nishio S ya, Yamaguchi T, Wakui K, Kobayashi Y et al. Frequency and clinical features of hearing loss caused by STRC deletions. Sci Rep. 2019;9(1):4408.10.1038/s 41598-019-40586-7PMC 641631530867468 · doi ↗ · pubmed ↗
- 2Luo T, Chen H, yang, Zou Qxing, Wang T, Cheng Y, min, Wang H et al. feng,. A novel copy number variation in CATSPER 2 causes idiopathic male infertility with normal semen parameters. Hum Reprod. 2019;34(3):414–23.10.1093/humrep/dey 37730629171 · doi ↗ · pubmed ↗
- 3CIFRAS DE POBREZA. 2022 INEI. [cited 2024 Feb 4]. Available from: https://www.inei.gob.pe/media/Menu Recursivo/publicaciones_digitales/Est/pobreza 2022/Pobreza 2022.pdf
- 4PROYECCIONES DE POBLACION TOTAL INEI. [cited 2023 Feb 1]. Available from: https://cdn.www.gob.pe/uploads/document/file/3624028/Per%C 3%BA%3A Proyecciones de Poblaci%C 3%B 3n Total seg%C 3%B An Departamento%2C Provincia y Distrito%2C 2018-2022.pdf?v=1663081335
- 5Han S, Zhang D, Guo Y, Fu Z, Guan G. Prevalence and Characteristics of STRC Gene Mutations (DFNB 16): A Systematic Review and Meta-Analysis. Front Genet. 2021 [cited 2023 Jan 19];12. Available from: https://www.frontiersin.org/articles/10.3389/fgene.2021.70784510.3389/fgene.2021.707845 PMC 849165334621290 · doi ↗ · pubmed ↗
