STK11 and DNA Repair Gene Mutations Define Hereditary Subset of Middle Eastern Papillary Thyroid Cancer
Rong Bu, Wael Haqawi, Eman A. Abdul Razzaq, Saud Azam, Kaleem Iqbal, Zeeshan Qadri, Sandeep Kumar Parvathareddy, Maha Alrasheed, Khadija Alobaisi, Fouad Al-Dayel, Abdul Khalid Siraj, Khawla S. Al-Kuraya

TL;DR
A study finds that some Middle Eastern papillary thyroid cancer cases are linked to inherited mutations in genes like STK11 and DNA repair genes, suggesting broader genetic screening is needed.
Contribution
The study identifies germline mutations in cancer predisposition genes in non-syndromic PTC patients from Saudi Arabia, revealing under-recognized hereditary risks.
Findings
11 out of 245 Saudi PTC patients had germline pathogenic variants in cancer susceptibility genes.
Mutations in DNA repair genes like FANCA and RAD50 were more common than expected.
A shared STK11 variant was found in two unrelated patients without Peutz–Jeghers syndrome features.
Abstract
Papillary thyroid cancer (PTC) is the most common endocrine malignancy with especially high incidence in Middle Eastern populations. While classical hereditary syndromes explain a minority of cases, the broader germline landscape of non-syndromic PTC remains unclear. Whole-exome sequencing was performed on 245 unselected Saudi PTC patients to identify germline pathogenic or likely pathogenic variants (PVs/LPVs) in cancer predisposition genes. Clinical and molecular characteristics, and family history were integrated to assess phenotypic correlations. Eleven patients (4.5%) harbored germline PVs/LPVs in cancer susceptibility genes including STK11, TP53, BRCA1, BRCA2, FANCA, SLX4, RAD50, MSH6, POLD1 and NF1. Four patients (36.4%) carried PVs/LPVs in canonical FA pathway genes; this increased to five patients (45.5%) when RAD50 was included. Two unrelated patients harbored the same STK11…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsThyroid Cancer Diagnosis and Treatment · BRCA gene mutations in cancer · Genetic factors in colorectal cancer
1. Introduction
Papillary thyroid cancer (PTC) is the most common histologic subtype of thyroid cancer, comprising approximately 80–90% of all cases globally [1]. Its incidence has been rising steadily over the past two decades [2], with particularly high rates observed in Middle Eastern countries such as Saudi Arabia where thyroid cancer now ranks the second most common cancer among women [3,4].
While the global increase in PTC incidence is partly attributed to advances in imaging and diagnostic sensitivity, emerging evidence suggests that underlying biological and genetic factors also contribute to its rising burden [5,6]. In this context, understanding the contribution of inherited genetic factors has become increasingly important.
Although most cases of PTC are sporadic, inherited predisposition is estimated to account for approximately 5–10% of non-medullary thyroid cancer (NMTC) [7,8,9,10]. These hereditary cases may occur either as part of well-defined cancer syndromes or in non-syndromic familial settings.
Syndromic diseases such as Cowden syndrome (linked to PTEN) [11,12], DICER syndrome, familial adenomatous polyposis (APC) [13] and Li–Fraumeni syndrome (TP53) [14] have been well characterized. However, the genetic basis of non-syndromic PTC remains poorly defined with no single high-penetrance gene identified as a dominant driver.
Furthermore, most prior studies have been limited by small sample sizes, targeted gene panels, or selection biases toward patients with strong family histories or syndromic features. Consequently, the true germline contribution to non-syndromic PTC, particularly in underrepresented populations, remains uncertain.
Recent efforts have begun to explore the broader landscape of germline cancer predisposition in PTC using next generation sequencing platforms [15,16].
These studies suggest that pathogenic or likely pathogenic variants (PVs/LPVs) in genes related to DNA repair, tumor suppression and cell cycle regulation may contribute to thyroid tumorigenesis even in the absence of classical syndromic phenotype [17,18,19]. However, such data remain sparse in Middle Eastern populations, where population-specific founder effects may further shape the germline mutational spectrum [20].
To address these gaps, and provide a more comprehensive assessment of hereditary susceptibility, we conducted germline whole-exome sequencing on a cohort of 245 unselected patients with PTC from Saudi Arabia. Our objective was to identify clinically relevant germline PVs/LPVs across the full spectrum of established cancer susceptibility genes, regardless of syndromic features or family history.
By integrating this unbiased genetic approach with clinical and demographic data, we aim to expand the understanding of hereditary contributions to PTC and inform future strategies for genetic screening and risk assessment, particularly in underrepresented populations.
2. Results
2.1. Clinico-Pathological Characteristics of the Study Cohort
The median age in this group was 39.2 years with patients ranging from 10 to 83 years, and the male to female ratio was 1 to 3.6. Most tumors were of the classical type making up 50.2% (123/245). Multifocal tumors were seen in 47.8% (117/245) of patients. Extrathyroidal extension was found in 41.6% (102 out of 245), while regional lymph node metastasis was noted in 38.4% (94/245) of cases. At the time of diagnosis, distant metastasis was identified in 6.5% (16/245) (Table 1).
2.2. Identification of PVs/LPVs in the Cohort
In the entire cohort, 12 different types of germline PVs/LPVs were identified in cancer susceptibility genes among 11 cases (4.5%), including STK11, TP53, BRCA1, BRCA2, FANCA, SLX4, RAD50, MSH6, POLD1 and NF1 (Table 2). This corresponds to 13 PV/LPVs occurrences across 11 cases because the STK11 variant occurred in two cases and one case carried two SLX4 variants. All detected variants were heterozygous. Four patients (36.4%) carried PVs/LPVs in canonical FA pathway genes; this increased to five patients (45.5%) when RAD50 was included. Eight patients (72.7%) harbored ten PVs/LPVs in eight DNA repair genes, defined here as genes involved in DNA repair and genome integrity, including FA/homologous recombination/mismatch repair/double-strand break repair pathways and TP53-mediated DNA damage response, as detailed in Table 2. All identified variants have been previously reported in public databases such as ClinVar and/or in the published literature. Variant classifications were independently evaluated according to American College of Medical Genetics and Genomics and the Association for Molecular Pathology (ACMG/AMP) guidelines and were consistent with available database annotations at the time of analysis. Clinicopathological characteristics of PTC patients with PVs/LPVs are included in Table 3.
One recurrent missense germline LPV (p.R304Q) of STK11 was detected in two cases, accounting for 18.2% of all mutant cases and 0.8% of all sequenced cases. A loss of heterozygosity (LOH) analysis on sequencing data of samples derived from tumor tissue of these two cases were performed; however, no LOH/second hit was detected in either tumor. One of these two cases had a positive family history of thyroid cancer (Table 3).
A haplotype analysis was carried out for two patients who had the recurrent variant and sufficient DNA sample using the PHASE version 2.1.1 tool. The analysis showed that the two individuals carrying the mutation c.911G>A; p.R304Q in STK11 gene shared the same haplotype in a very short region of ~0.64 MB, suggesting that this recurrent mutation has insufficient evidence to be a potential founder.
3. Discussion
The germline genetic landscape of PTC, the predominant form of Differentiated Thyroid Cancer (DTC), remains insufficiently explored, particularly in non-syndromic, unselected patients.
Although syndromic forms of NMTC such as Cowden syndrome, FAP, DICER syndrome and Li–Fraumeni syndrome have been well described, the genetic basis of non-syndromic PTC remains largely undefined especially in the Middle Eastern population.
Our study addresses this gap by applying WES analysis to an unselected cohort of 245 Saudi PTC patients, systematically screening across known cancer susceptibility genes without preselection based on clinical features.
PVs/LPVs were identified in 11/245 (4.5%) of patients. Analysis reveals enrichment within the PV/LPV-positive subset of DNA repair genes alterations, particularly those associated with the Fanconi Anemia (FA) and double-strand break (DSB) repair pathways. Our findings suggest STK11 as a potential candidate susceptibility gene in non-syndromic papillary thyroid carcinoma; however, these observations are hypothesis-generating and warrant validation in larger cohorts and functional studies.
Among 11 patients harboring germline PVs/LPVs, four (36.4%) were involved in canonical FA pathway genes, including SLX4 (FANP), BRCA1 (FANCS), BRCA2 (FANCD1), and FANCA. When RAD50, a gene functionally linked to the FA network via its role in homologous recombination [21] is included, the proportion rises to five cases (45.5%). These findings suggest the disruption of the FA pathway may represent a previously underappreciated mechanism of genetic susceptibility in PTC.
While prior pan-cancer germline analysis, such as the study by Zhang et al. [22], identified occasional FA pathway mutations in pediatric malignancies, these genes have not been specifically implicated in thyroid cancer. Similarly, SLX4 mutations have been described in FA subtype P [23] and in rare cases of breast and ovarian cancer, but not previously in PTC [24].
The recurrence of SLX4 mutations in two unrelated patients suggests potential allelic significance and expands the candidate gene landscape for hereditary thyroid cancer.
Although the majority of patients in our cohort did not have a documented family history of PTC, germline PVs/LPVs were identified in 4.5% of the cases.
The detection of these variants in largely non-familial presentations underscores the limitations of relying solely on family history for hereditary risk assessments and suggests that a meaningful proportion of germline predisposition may be attributable to de novo or low-penetrance variants.
Our findings demonstrating a substantial fraction of PVs/LPVs in DNA repair genes such as FANCA and RAD50, are consistent with emerging evidence linking impaired DNA repair capacity to thyroid carcinogenesis. Qin et al. 2021 [25] reported that patients with papillary thyroid microcarcinoma (PTMC) and reduced double-strand break (DSB) repair capacity had significantly increased risk of developing thyroid cancer, particularly following radiation exposure. Notably, FANCA and RAD50 were among the most frequently mutated genes in their cohort, mirroring our germline findings. Together, these observations reinforce the biological plausibility that genomic instability due to DNA repair pathways may contribute to PTC development, even in the absence of classical syndromic features.
Equally noteworthy is our identification of germline STK11 mutations in two of 245 PTC cases (0.8%), both carrying the same variant (c.911G>A; p.R304Q). Neither patient exhibited classical features of Peutz–Jeghers syndrome (PTS). One patient had a family history of T-cell lymphoma and family history of PTC (grandmother). Both variants were non-truncating and lacked detectable somatic second hits or loss of heterozygosity (LOH).
A haplotype analysis was performed for the two unrelated patients carrying the recurrent STK11 p.R304Q germline variant to investigate a possible founder effect. The analysis revealed a shared haplotype block of approximately 0.64 Mb, which is relatively small. Therefore, there is insufficient evidence to establish a potential founder effect for this mutation.
The co-occurrence of thyroid cancer, T-cell lymphoma and family history in the absence of PJS features suggests a non-syndromic, atypical/non-classical STK11-associated presentation.
Similarly, a TP53 hotspot mutation (p.R175H) was identified in a PTC patient with personal cancer history of early onset stomach cancer, a malignancy with the extended Li–Fraumeni syndrome spectrum. Although patients did not meet full clinical criteria for the syndrome, this finding suggests an atypical expression of TP53-related cancer predisposition and underscores the phenotype variability associated with germline TP53 mutations [26].
Additionally, the BRCA mutation identified in our cohort is a known founder variant in Saudi population [27,28]. Its presence in patients with non-syndromic PTC reinforces the hypothesis that population-specific founder mutations may extend their oncogenic impact across multiple organ sites. These observations further validate the importance of incorporating founder effect data into germline testing strategies in underrepresented populations.
Interestingly, one patient in our cohort harbored concurrent PV/LPV in SLX4 and BRCA2, two critical components of the homologous recombination and FA repair pathway. This co-mutation may represent compounding defects in DNA repair fidelity, potentially resulting in heightened genomic instability and elevated cancer susceptibility. The presence of dual germline hits despite the absence of overt syndrome features emphasizes the need to consider synergistic effects within the same molecular pathway when evaluating hereditary risk and highlights the relevance of pathway-level analyses beyond gene screening in future precision oncology approaches.
Collectively, Our findings reveal that a substantial fraction of germline PV/LPV-positive PTC cases are attributable to mutations in DNA repair genes, particularly within FA and DSB repair pathways. This contrasts with heterogeneous results of previous studies, where no dominant high-penetrance gene has been identified. Although FNMTC accounts for approximately 5% of NMTC cases, our unbiased exome-wide screening uncovered a substantial fraction of PV/LPVs, suggesting gene-based testing may uncover hereditary predisposition even without overt familial clustering.
Beyond FA and DSB repair, the identification of germline MSH6 and POLD1 mutations underscores a broader role of mismatch repair and replication fidelity defects in PTC susceptibility. Although infrequent, these mutations are clinically actionable and support consideration of broader genetic strategies particularly in selected patient populations.
This study is primarily descriptive and does not include population-matched controls or enrichment/burden analyses; therefore, the observed PV/LPV patterns should be interpreted as hypothesis-generating. A minority of germline DNA samples were derived from histologically normal FFPE tissue (19 cases; 1 PV/LPV-positive), which may introduce FFPE-associated technical variability; however, histologically normal tissue was independently reviewed and marked by two pathologists with tumor areas strictly excluded. Furthermore, LOH analysis was feasible only in a limited number of STK11 carriers with available paired tumor material, which restricts interpretability and generalizability; these findings are therefore descriptive. The absence of orthogonal validation is acknowledged as a limitation. The haplotype analysis is exploratory due to the limited number of carriers and should be interpreted cautiously. Formal PCA-based ancestry inference and computational contamination estimation were not performed, as the cohort is clinically homogeneous and the study is descriptive without population-comparative analyses. Contamination control relied on histologic review of normal tissue and standard germline DNA extraction procedures. These aspects are acknowledged as limitations.
In summary, our study highlights the identification of germline PVs/LPVs particularly in STK11, TP53, SLX4, FANCA, BRCA1/2 and RAD50 defined previously under recognized hereditary subset of PTC. These findings support incorporating DNA repair, founder genes into germline testing strategies, and underscore the need for broader population specific approaches to risk stratification and surveillance.
Future research should aim to validate these observations in larger cohorts across different ethnic groups to assess penetrance and improve early detection and surveillance in genetically predisposed individuals
4. Materials and Methods
4.1. Study Cohort
Clinical samples from 245 patients diagnosed with PTC between 1990 and 2020 at King Faisal Specialist Hospital and Research Center (KFSHRC) were included. Clinicopathological details were collected from medical records and presented in Table 1. All 245 diagnosed PTC cases included in this cohort underwent germline whole-exome sequencing and passed basic quality control, with no samples excluded after QC. A study flow diagram summarizing the 245 case cohort has been presented as Supplementary Figure S1.
This study received ethical approval from the Institutional Review Board and the Research Advisory Council (RAC) of KFSHRC (approval number: 2110 031; approval date: 6 February 2012). Written informed consent was obtained from all participants prior to inclusion in the study.
4.2. DNA Extraction and Library Preparation
Genomic DNA extraction was performed using QIAGEN Gentra kit (Minneapolis, MN, USA) as described previously [29]. Germline DNA was primarily extracted from peripheral blood (n = 226). Normal formalin-fixed paraffin-embedded (FFPE) tissue was used in a minority of cases (n = 19). Among the 11 PV/LPV-positive patients, germline DNA was derived from blood in 10 cases and from normal FFPE tissue in one case. For FFPE-derived germline samples, histologically normal tissue was independently reviewed and marked by two pathologists prior to DNA extraction, with tumor areas strictly excluded. Samples that met quality control (QC) standards proceeded to library construction.
Standard exome capture libraries were generated using the Agilent SureSelectXT Target Enrichment kit, Santa Clara, CA, USA. The library preparation process involved random fragmentation of the DNA, followed by 5′ and 3′ adapter ligation. The adapter-ligated fragments were then amplified via PCR. Subsequently, the prepared library was loaded onto a flow cell where fragments were captured by surface-bound oligonucleotides complementary to the library adapters. Each fragment was then amplified into a distinct clonal cluster through bridge amplification, preparing the templates for sequencing.
4.3. Whole-Exome Sequencing
Sequencing was performed on the Illumina NovaSeq 6000 platform, San Diego, CA, USA. Raw image data was processed into base calls, generating binary base call (BCL) files. These BCL files were subsequently de-multiplexed and converted to the FASTQ format using Illumina’s bcl2fastq software (v2.20.0). The quality of the sequencing reads in the FASTQ files was assessed using FastQC (v0.11.9).
4.4. Data Analysis and Variant Calling
Sequencing reads were aligned to the human reference genome (hg19) using the Burrows–Wheeler Aligner (BWA, v0.7.15). Alignment to hg19 was performed to maintain compatibility with the institution’s validated analysis pipeline and legacy annotation resources used for this cohort. Sequencing yielded a median read depth of 218× (range: 93–1008×) per sample. Post-alignment processing, including local realignment and marking of PCR duplicates, was performed using Picard tools (v1.119). Base quality score recalibration was conducted with the Genome Analysis Toolkit (GATK, v4.0.12.0).
Variant calling was performed on each sample using the HaplotypeCaller tool within GATK package (v4.0.12.0). The identified variants were subsequently annotated using the ANNOVAR tool (available at: http://annovar.openbioinformatics.org/; accessed on 12 January 2024) [30]. Variant annotations were gathered from multiple publicly available genomic databases, including dbSNP138, the 1000 Genomes Project, ESP6500, the Exome Aggregation Consortium (ExAC), ClinVar, and other related resources. Germline variants were filtered using a minimum read depth of ≥20, requiring at least five supporting alternate reads. Heterozygous variants were accepted only if the variant allele fraction (VAF) fell between 0.30 and 0.85. All reported PV/LPVs were manually reviewed in IGV to confirm read alignment and exclude sequencing artifacts, such as strand bias or localization at the ends of reads. Population frequency annotation and variant validation were performed using gnomAD v2.1 (hg19 build) along with ExAC, dbSNP build 138, and ClinVar where applicable. Furthermore, variants with more than two reads exhibiting a mapping quality of zero were excluded as false positives. The classification of variants as pathogenic or likely pathogenic followed the established guidelines set forth by the American College of Medical Genetics and Genomics and the Association of Molecular Pathology (ACMG/AMP) [31]. Orthogonal validation by Sanger sequencing was not performed. All reported PV/LPVs were supported by adequate sequencing depth and manual IGV review.
4.5. LOH Analysis
Loss of heterozygosity (LOH) of the STK11 germline pathogenic variants was evaluated using paired tumor–normal DNA sequencing data. Paired tumor–normal WES data was available for n = 2 STK11 carriers (one tumor per carrier), utilizing a minimum threshold of five alternative reads and a regional sequencing depth of ≥20. LOH at the variant locus was defined as a shift in allelic status from heterozygous in the germline sample to either homozygous or hemizygous in the matched tumor sample, based on comparative variant calling between the paired datasets.
4.6. Haplotype Analysis
A high-throughput SNP genotyping array (Illumina Infinium) was employed for genotyping, utilizing a custom design panel of 778,783 SNPs, according to the manufacturer’s instructions (Illumina Inc., San Diego, CA, USA). Genotype calls and normalized signal intensities were determined with the aid of the Illumina Bead Array Files Python library (available at: https://github.com/Illumina/BeadArrayFiles; accessed on 2 June 2024). The quality assessment included monitoring p10 GC scores and sample call rates. The resulting filtered genotype data for all analyzed samples and probes were compiled into a text file. Additionally, SNP data for 100 control subjects were sourced from our in-house database.
We performed haplotype reconstruction for the two test samples alongside the 100 control individuals using tool PHASE (version 2.1.1) [32,33]. Input for the algorithm consisted of variant and nucleotide positions, along with the genotypes for all subjects at those loci. The analysis was run with 100 iterations, a thinning interval of one, and a burn-in of 100. Haplotype analysis was performed as an exploratory assessment to evaluate whether the recurrent STK11 variant may share a common haplotype background.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Lloyd R.V. Buehler D. Khanafshar E. Papillary thyroid carcinoma variants Head. Neck Pathol.20115515610.1007/s 12105-010-0236-921221869 PMC 3037461 · doi ↗ · pubmed ↗
- 2Pellegriti G. Frasca F. Regalbuto C. Squatrito S. Vigneri R. Worldwide Increasing Incidence of Thyroid Cancer: Update on Epidemiology and Risk Factors J. Cancer Epidemiol.2013201396521210.1155/2013/96521223737785 PMC 3664492 · doi ↗ · pubmed ↗
- 3Hussain F. Iqbal S. Mehmood A. Bazarbashi S. El Hassan T. Chaudhri N. Incidence of thyroid cancer in the Kingdom of Saudi Arabia, 2000–2010 Hematol. Oncol. Stem Cell Ther.20136586410.1016/j.hemonc.2013.05.00423756719 · doi ↗ · pubmed ↗
- 4Qusty N.F. Albarakati A.J.A. Almasary M. Alsalamah S. Alharbi L. Alharthi A. Al Sulaiman I.N. Baokbah T.A.S. Taha M.Sr. Thyroid Cancer Knowledge and Awareness in Saudi Arabia: A Cross-Sectional Study Cureus 202315 e 4788810.7759/cureus.4788838034267 PMC 10681848 · doi ↗ · pubmed ↗
- 5Rangel-Pozzo A. Sisdelli L. Cordioli M.I.V. Vaisman F. Caria P. Mai S. Cerutti J.M. Genetic Landscape of Papillary Thyroid Carcinoma and Nuclear Architecture: An Overview Comparing Pediatric and Adult Populations Cancers 202012314610.3390/cancers 1211314633120984 PMC 7693829 · doi ↗ · pubmed ↗
- 6Mussazhanova Z. Rogounovitch T.I. Saenko V.A. Krykpayeva A. Espenbetova M. Azizov B. Kondo H. Matsuda K. Kalmatayeva Z. Issayeva R. The Contribution of Genetic Variants to the Risk of Papillary Thyroid Carcinoma in the Kazakh Population: Study of Common Single Nucleotide Polymorphisms and Their Clinicopathological Correlations Front. Endocrinol.20211154350010.3389/fendo.2020.54350033551988 PMC 7862756 · doi ↗ · pubmed ↗
- 7Sippel R.S. Caron N.R. Clark O.H. An evidence-based approach to familial nonmedullary thyroid cancer: Screening, clinical management, and follow-up World J. Surg.20073192493310.1007/s 00268-006-0847-117429563 · doi ↗ · pubmed ↗
- 8Charkes N.D. On the prevalence of familial nonmedullary thyroid cancer in multiply affected kindreds Thyroid 20061618118610.1089/thy.2006.16.18116613533 · doi ↗ · pubmed ↗
