An Oxford Nanopore Technologies–Based Sequencing Assay for Molecular Diagnosis of Phenylketonuria and Variant Frequencies in a Turkish Cohort
Gülten Tuncel, Mehmet Cihan Balcı, Gökçe Akan, Hasan Hüseyin Kazan, Özge Özgen, Ahmet Çağlar Özketen, Meryem Karaca, Asuman Gedikbaşı, Fatmahan Atalar, Gülden Fatma Gökçay

TL;DR
This study developed a fast and cost-effective genetic test for PKU using Oxford Nanopore sequencing, validated against Illumina, and identified common mutations in a Turkish cohort.
Contribution
A novel ONT-based PAH gene sequencing assay for PKU diagnosis with variant frequency analysis in a Turkish population.
Findings
ONT-based sequencing showed complete agreement with Illumina for PKU variant detection.
The most common variants in the Turkish cohort were c.1066-11G > A (37.5%) and c.782G > A (15%).
ONT sequencing proved reliable and faster for neonatal PKU screening in high-prevalence regions.
Abstract
Background: Phenylketonuria (PKU) is an autosomal recessive metabolic disorder caused by mutations in the PAH gene, resulting in deficient phenylalanine hydroxylase (PAH) enzyme activity and neurotoxic phenylalanine accumulation. Untreated PKU results in progressive neurodegeneration and severe intellectual disability. Neonatal screening has evolved from the Guthrie test to advanced techniques like HPLC, tandem mass spectrometry, and next-generation sequencing (NGS) for molecular confirmation. This study aimed to develop a rapid, scalable PAH genetic assay using Oxford Nanopore Technologies (ONTs) to enhance neonatal screening in high-prevalence regions like Türkiye, through accelerated, cost-effective genetic diagnostics. Methods: An in-house panel was designed, implemented, and benchmarked against results obtained from the Illumina sequencing platform. A cohort of 40 PKU patients,…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2- —Istanbul Üniversitesi
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMetabolism and Genetic Disorders · Mitochondrial Function and Pathology · Genomics and Rare Diseases
1. Introduction
Phenylketonuria (PKU), also known as phenylalanine hydroxylase (PAH) deficiency (OMIM #261600), is an autosomal recessive disorder characterized by mutations in the PAH gene. The PAH enzyme (EC 1.14.16.1) encoded by PAH gene normally catalyzes the hydroxylation of Phe to tyrosine, a crucial step in the catabolism of dietary proteins and other sources of Phe. Under physiological conditions, PAH enzyme plays an important role in eliminating approximately 75% of the Phe obtained from dietary sources and catabolism of the proteins. The expression of PAH is high in the liver where excess amount of Phe is removed. However, in individuals with PKU, the deficient PAH enzyme fails to adequately metabolize Phe, resulting in its accumulation in the bloodstream and leading to its potentially toxic metabolites. This pathophysiological condition, known as hyperphenylalaninemia (HPA), is the most severe form in PKU when pretreatment serum Phe levels are more than > 1200 μM. In PKU, excessive Phe and its toxic metabolites interfere with neurotransmission in the brain, causing neurological complications such as mental retardation and involuntary movements [1–3]. Symptoms of PKU typically arise within the first few months of life, though they can be subtle initially. If left untreated, elevated Phe levels can lead to severe intellectual disability, developmental delays, behavioral problems, and neurological complications such as seizures and microcephaly. Additionally, untreated individuals with PKU may exhibit lighter skin and hair compared to their family members due to decreased melanin synthesis [4]. The spectrum of symptoms can vary widely, with some individuals experiencing milder forms of the disorder that still require treatment to prevent cognitive impairment [3]. Once diagnosed, the treatment of PKU primarily requires a strict, lifelong dietary regimen designed to restrict Phe intake. Recent advancements in PKU treatment include sapropterin dihydrochloride (BH4) supplements that boosts enzyme activity in some patients, and other options such as LNAA supplements, which help lower Phe levels. These treatments represent avenues for enhancing the management of PKU beyond dietary restrictions alone [5–7].
The prevalence of PKU varies globally, with incidence rates typically around 1 in 10,000–15,000 births in the United States and Europe. However, in certain regions such as Türkiye and Ireland, higher prevalence rates have been observed, attributed to genetic factors and increased consanguinity [8–10]. PKU was the first disease to be identified by the neonatal screening programs. These population-based screening initiatives have significantly improved PKU detection and management, enabling early intervention and significantly reducing the disease burden [11].
Over the years, diagnostic methods for PKU have significantly evolved, from the simple ferric chloride urine test to the Guthrie test enabling the neonatal screening to become a pivotal public health measure in several countries. Screening typically involves the measurement of Phe levels in dried blood spots collected shortly after birth, followed by confirmatory testing using quantitative amino acid analysis or Sanger sequencing [12]. Initially the Guthrie test, a semiquantitative method with limited sensitivity was employed. This was subsequently replaced by more sensitive and precise techniques such as quantitative fluorometric methods, and consecutively by tandem mass spectrometry (MS) technology, which enhanced the accuracy and efficiency of screening programs [13].
In parallel, over the last decade, next-generation sequencing (NGS) systems have gained prominence in genetic diagnosis. Nevertheless, in practice, the diagnosis of PKU still primarily relies on blood Phe levels. Genetic analyses are conducted to assess tetrahydrobioterin (BH4) responsiveness, predict the clinical form of the disease, and differentiate BH4 metabolism disorders. NGS allows comprehensive analysis of the PAH gene, enabling the identification of known and novel mutations with high accuracy and speed which was not possible with MS/MS. Unlike traditional methods, this advancement has facilitated earlier detection, precise characterization of genetic variants, and tailored management strategies for individuals affected by PKU [14, 15]. However, despite their widespread adoption in neonatal screening, NGS can encounter challenges, especially in the time-consuming nature of the library preparation [16]. Third-generation NGS systems, exemplified by Oxford Nanopore Technologies (ONTs) present a promising solution to the limitations associated with earlier, second-generation NGS platforms [17]. ONT systems have the potential to reduce both costs and labor requirements, addressing efficiency concerns and enhancing the feasibility of genetic screening and diagnostic applications for conditions like PKU.
In the present study, our objective was to develop an ONT-compatible approach for single-gene testing in the molecular diagnosis of PKU. We employed an in-house panel specifically designed for screening clinical samples and evaluated its performance by comparing it with the Illumina sequencing platform. This approach aims to explore the potential of ONT systems in human genetics applications, particularly for diagnosing inherited metabolic disorders, thereby representing an innovative step towards advancing neonatal screening.
2. Methods
2.1. Study Design and Population
This study was designed as a method development and validation study. Then, 40 patients were recruited from the Istanbul University, Istanbul Medical Faculty, Department of Pediatrics, Division of Nutrition and Metabolism between 2021 and 2024 years for this study, which was structured as a test validation approach. The present study was approved by Istanbul Medical Faculty Ethics Committee under the number 23.11.2022-1397546. Verbal and written consents were obtained from each patient or their guardian. All patients had previously been diagnosed with PKU based on quantitative Phe results determined by high-performance liquid chromatography (HPLC), and molecular diagnoses had been conducted using an Illumina-based targeted panel. Prior to collecting peripheral blood samples for molecular studies, demographic information listed in Table 1 was obtained. Total genomic DNA was isolated from the participants blood using QIAmp DNA Blood Mini kit (Qiagen, Hilden, Germany) following the manufacturer's protocol.
2.2. Primer Design and Optimization
The primers designed to amplify the exons, untranslated regions (UTRs), and exon–intron boundaries of the PAH gene (NM_000277.3) in which pathogenic or likely pathogenic variants have already been reported (Figure 1) were generated using the NCBI Primer-BLAST tool [18]. A total of nine primer pairs were designed to produce amplicons of approximately 1500 to 5000 base pairs in size. Prior to pooling for patient sample sequencing, the designed primers underwent analysis using in silico PCR (Silica; Gear Genomics; https://www.gear-genomics.com/silica/). This computational approach enabled virtual PCR amplification of the target regions to predict and validate primer specificity and efficiency before experimental implementation.
For primer optimization, human reference DNA (Agilent Technologies, United States) was used as a positive control, while a negative control containing all reaction components except template DNA was included to assess potential contamination and ensure the specificity of the amplification. Each primer was used at a final concentration of 0.1 μM for amplification of target regions, using 2X Hi-Fi enzyme master mix (MobiomX, Türkiye) and 35 ng of DNA. The optimized thermal cycling profile included an initial denaturation at 95°C for 3 min, followed by 40 cycles of denaturation at 95°C for 15 s, annealing at 66°C for 45 s and extension at 72°C for 4 min; and a final extension step at 72°C for 10 min concluded the thermal cycling process. PCR products were visualized by gel electrophoresis on a 1% agarose gel, confirming the presence of the DNA bands of the expected size, while the negative control exhibited no amplification, validating the reaction's specificity. Subsequently, the primers were pooled into two tubes based on rational considerations. Following primer pooling, PCR was conducted using human reference DNA and negative controls. A representative image of the gel electrophoresis results is provided in Figure S1. The resulting amplicons were then subjected to sequencing. After sequencing, coverage and read depths were checked and primer pairs were exchanged between the pools to address any amplification issues, ensuring uniform coverage of the PAH gene. Patient samples were then screened using the optimized primer pools and sequencing protocols described above.
2.3. Library Preparation and ONT Sequencing
Sequencing libraries were prepared for this study using the following materials and equipment: SQK-LSK109, and EXP-NBD196 kits, FLO-MIN106D flow cell, and Mk1C device from ONTs (Oxford, United Kingdom). PCR amplicons were initially purified using AMPure XP beads (Beckman Coulter Life Sciences, United States), and subsequently, they underwent end-repair using the NEBNext Ultra II End-Repair/dA-tailing Module (New England Biolabs, United States), followed by barcoding. After another purification step to ensure high quality, the products were adapter-ligated using Adapter Mix II. After a final purification step and determination of the concentrations using Qubit 4 Fluorometer (Thermo Scientific, United States), 400 ng of the libraries was loaded onto the MinION Mk1c device and sequenced for a duration of 48 h.
2.4. Data Analysis
In the ONT platform, Guppy software was used for the basecalling and demultiplexing of sequencing data. Raw Fastq files underwent processing using Massive Analyser (Massive Bioinformatics, Türkiye), which included quality assessment using FastQC (http://www.bioinformatics.babraham.ac.uk/projects/fastqc/) embedded within the Massive Analyser program. To align reads to the human reference DNA sequence, an in-house bed file was utilized to correct alignment errors with MiniMap2 [19]. Medaka (https://github.com/nanoporetech/medaka) was then employed to correct the alignment errors. Variant calling was performed using Clair3-Trio [20], and subsequent variant annotation was conducted with ANNOVAR [21]. The identified variants were filtered using VarAFT software [22] and prioritized by Franklin by Genoox (https://franklin.genoox.com) according to the American College of Medical Genetics (ACMG) criteria [23]. Genotype–phenotype correlations were established using entries from the BioPKU database [24] and biochemical classification of PKU (classic, mild, or undefined), as previously described by Nalin et al. [25].
3. Results
3.1. Patients and the Demographic Information
The present study enrolled 40 patients diagnosed with PKU, whose molecular diagnosis had been previously conducted using Illumina-based targeted sequencing. Clinical diagnosis of PKU was done according to patient Phe levels, as classic PKU (Phe > 1200* μ*mol/L) and mild PKU (Phe 600–1200 μmol/L). Accordingly, the patient group included 31 classic PKU (77.5%) and 9 mild PKU (22.5%) patients. These patients were currently undergoing dietary treatment. Among these patients, 18 (45%) were female and 22 (55%) were male. Of the 40 patients, clinical diagnosis of 30 were in concordance with the BioPKU data. Their ages ranged between 1 and 37 years old, reflecting the diverse age distribution of follow-up patients. According to the questionnaire, which had missing data for four patients, it was found out that the parents of 17 patients (47.2%) were consanguineous (Table 1).
3.2. Assay Optimization and Sequencing Quality Assessment
The ONT-based PAH screening assay was developed to amplify the exons, UTRs, and exon–intron boundaries using 9 primer pairs. After primary and singularly evaluation of the primer pairs by human reference DNA, the primers were pooled into two tubes. Subsequently, these pooled primers underwent further validation using human reference DNA, involving gel electrophoresis and ONT-sequencing. The sequencing quality was assessed using FastQC, and read visualization and analysis were performed using the Integrative Genomics Viewer (IGV) tool [26]. According to the findings (Figure 2), all targeted genomic regions; exons, UTRs, and exon–intron boundaries, were effectively covered with varying read depths ranging from 50 to 2000x. Moreover, the read quality was observed to decrease slightly for sequences in the range of approximately 1500 to 5000 bps. However, this characteristic was deemed suitable and advantageous for subsequent screening of the patient samples.
3.3. Variant Evaluation
Following assay optimization using human reference DNA, the developed system was applied to screen PAH gene in PKU patients previously diagnosed using an Illumina-based sequencing panel. The results pointed out that the newly developed assay effectively detected all pathogenic variants already identified by the Illumina sequencing (Table 1). However, the allele frequency necessary to determine homozygosity was relatively low, specifically greater than or equal to 0.58.
Among the patients, the intronic variant, NC_000012.12 (NM_000277.3):c.1066-11G > A exhibited a high frequency, detected in 15 patients (15/40; 37.5%). Among these, six patients were homozygous, eight patients were compound heterozygous, and one patient was singularly heterozygous for this mutation. The second most frequently detected variant was NC_000012.12 (NM_000277.3):c.782G > A (6/40; 15%) found in six patients, with two patients demonstrating a homozygous pattern and four patients being heterozygous. Importantly, 17 patients (42.5%) had pathogenic/likely pathogenic variants in a compound heterozygous configuration, while three patients (7.5%) possessed a single heterozygous variant insufficient to explain the genetic basis of the disease.
4. Discussion
In the present study, we designed a novel in house amplification-based PAH-screening panel adaptable to one of the long-read sequencing platforms, ONT. Pooling nine primer pairs covering all exonic, splicing, and mutation-reported deep intronic regions and UTRs of the PAH gene into two tubes, we successfully optimized the system using reference human DNA. By optimized system, we screened 40 PKU patients who were already molecularly diagnosed by Illumina platform. Our results pointed out a 100% correlation with clinical reports of the patients, emphasizing the success rate of the in-house ONT system in clinical genetics in terms of molecular diagnosis of PKU.
Early and accurate diagnosis of PKU is crucial given its treatable nature. Biochemical and analytical methods are typically applied for the neonatal screening of PKU, making it a standard inclusion in neonatal screening programs across many countries [27]. However, conventional screening methods may be misleading, particularly when the levels of Phe are elevated due to other inherited metabolic disorders involving deficiencies in tetrahydrobiopterin metabolism [3]. MS/MS is the method of choice for neonatal screening, yet the high ratio of false positive results significantly restricts its applicability [28]. Thus, molecular diagnosis plays a pivotal role in ensuring accurate differential diagnosis of PKU. Thus, sequencing PAH gene using an NGS platform offers distinct advantages in terms of rapid, cost-effective, and accurate screening for PKU. By biochemical or analytical methods, PKU can be diagnosed; nonetheless, the severity and treatment of the disease could be heterogenous depending on the affected member of the tetrahydrobiopterin metabolism. Thus, analytical methods such as MS/MS could primarily be used for the diagnosis of PKU; nevertheless, molecular methods including NGS as a second-tier testing are required for differential diagnosis and accordingly proper treatment approaches [29]. Among the NGS platforms, so-called second-generation systems are predominantly used systems for the molecular diagnosis of PKU [30, 31]. However, owing to having a feature to read the target gene regions as small fragments, these systems may fail to detect large deletions and/or duplications reported with high frequencies in the PAH gene [32]. Moreover, the establishment of the NGS sequencing facilities and the associated per-sample cost can be relatively high, posing challenges for equal accessibility in local regions, and potentially limiting compliance with regulatory requirements for the neonatal screening programs [33, 34]. ONT platform offers a practical and effective alternative in terms of time and cost burden in diagnostic screening [35]. Hence, we study long-read sequencing capacity of ONT to improve differential diagnosis of PKU for adoption in second-tier genetic analysis.
According to the results obtained in the study, the amplicon sizes aligned with the expectations set by the primer designing strategy, typically ranging from 1500 to 5000 bp. This alignment was confirmed through both the ONT sequencing and the gel electrophoresis. All targeted regions, including exons, UTRs, and exon–intron boundaries of the PAH gene, were successfully covered by our assay. However, the quality scores of the reads obtained from ONT sequencing were lower compared to the data generated by Illumina platforms (http://www.bioinformatics.babraham.ac.uk), which is a known characteristic of ONT sequencing. Despite this, the read qualities were relatively high when compared to the studies reported in existing literature [36]. Hence, we regarded to further validate the developed system by the patient samples.
The validation of the system using patient samples, who were previously diagnosed using an Illumina-based assay, focused on detecting the pathogenic/likely pathogenic variants. According to the results, the read depths across the target regions varied, and adjustments in primer design and/or input concentrations did not resolve this variability, potentially attributed to the challenges associated with long PCR applications [37]. Nevertheless, the minimum read depth exceeded 50x which was deemed adequate for the molecular analyses [38]. Importantly, the variants that were detected by the developed ONT-based systems showed a 100% correlation with those identified by the Illumina-based assay. When comparing variant allele frequencies between the two systems, the threshold for the determination of the homozygosity was set at ≥ 0.58, which is relatively low. However, this threshold accurately classified all the evaluated variants.
Among the variants identified in the study, the intronic mutation, NC_000012.12 (NM_000277.3):c.1066-11G > A was the most prevalent, detected in 15 out of 40 patients (37.5%). The second most frequently identified variant was NC_000012.12 (NM_000277.3):c.782G > A, found in 6 out of 40 patients (15%). Similar studies in the PKU patients from Türkiye underlined high frequencies of those variants, albeit with varying ratios [39–41]. Compound heterozygosity was more common than homozygosity, indicating that many patients carried different variants on each allele of the PAH gene. In three patients, a second variant necessary to establish compound heterozygosity could not be detected. This scenario suggests the presence of structural or deep intronic variants that were not covered by the sequencing systems used in this study. Finally, the incidence of PKU in Türkiye is relatively high, estimated at 1 in 6667 births [42]. This elevated incidence may be attributed to the high rate of consanguineous marriages among the parents of the patients, observed in 17 out of 40 patients (47.2%) in the present study. Consanguinity increases the likelihood of inherited recessive disorders like PKU manifesting in the offspring due to the increased chance of both parents carrying the same pathogenic variants.
5. Conclusion
We have developed and validated a novel in-house designed ONT-based assay for PKU diagnosis, achieving 100% concordance with Illumina-based results despite ONT's lower read quality. With comprehensive PAH gene coverage, long-read capabilities, and cost-effectiveness, the assay is well-suited for high-prevalence regions like Türkiye, where consanguinity increases PKU risk. The assay's rapid and reliable results support timely intervention, potentially improving neonatal screening and patient outcomes, especially in resource-limited settings. However, further cohort studies are needed to fully assess its efficacy in detecting structural and deep intronic variants due to the lack of such cases in the present study.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Regier D. S. Greene C. L. Phenylalanine Hydroxylase Deficiency Gene Reviews [Internet], Jan. 2017, Accessed: Jun. 10, 2024. [Online]. Available: https://www.ncbi.nlm.nih.gov/books/NBK 1504/
- 2Flydal M. I. Martinez A. Phenylalanine Hydroxylase: Function, Structure, and Regulation IUBMB Life 201365434134910.1002/IUB.11502-s 2.0-8487535325623457044 · doi ↗ · pubmed ↗
- 3van Spronsen F. J. Blau N. Harding C. Burlina A. Longo N. Bosch A. M. Phenylketonuria Nature Reviews Disease Primers 20217110.1038/S 41572-021-00267-034017006 PMC 8591558 · doi ↗ · pubmed ↗
- 4Vockley J. Andersson H. C. Antshel K. M. Phenylalanine Hydroxylase Deficiency: Diagnosis and Management Guideline Genetics in Medicine 201416218820010.1038/GIM.2013.1572-s 2.0-8489842242624385074 · doi ↗ · pubmed ↗
- 5Longo N. Siriwardena K. Feigenbaum A. Long-Term Developmental Progression in Infants and Young Children Taking Sapropterin for Phenylketonuria: A Two-Year Analysis of Safety and Efficacy Genetics in Medicine 201517536537310.1038/GIM.2014.1092-s 2.0-8492888017225232857 · doi ↗ · pubmed ↗
- 6Scala I. Riccio M. P. Marino M. Bravaccio C. Parenti G. Strisciuglio P. Large Neutral Amino Acids (LNA As) Supplementation Improves Neuropsychological Performances in Adult Patients with Phenylketonuria Nutrients 2020124 p. 109210.3390/NU 1204109232326614 PMC 7230959 · doi ↗ · pubmed ↗
- 7Scala I. Brodosi L. Rovelli V. Noto D. Burlina A. Management of Patients With Phenylketonuria (PKU) Under Enzyme Replacement Therapy: An Italian Model (Expert Opinion) Molecular Genetics and Metabolism Reports 20243910106510.1016/J.YMGMR.2024.10106538425869 PMC 10899016 · doi ↗ · pubmed ↗
- 8El-Metwally A. Yousef A.-A. L. Ayman S. A. The Prevalence of Phenylketonuria in Arab Countries, Turkey, and Iran: A Systematic Review Bio Med Research International 201820181769721010.1155/2018/76972102-s 2.0-8504869797829850564 PMC 5932526 · doi ↗ · pubmed ↗
