# Analysis of Short Tandem Repeat Expansions in a Cohort of 12,496 Exomes from Patients with Neurological Diseases Reveals Variable Genotyping Rate Dependent on Exome Capture Kits

**Authors:** Clarissa Rocca, David Murphy, Chris Clarkson, Matteo Zanovello, Delia Gagliardi, Queen Square Genomics, Rauan Kaiyrzhanov, Javeria Alvi, Reza Maroofian, Stephanie Efthymiou, Tipu Sultan, Jana Vandrovcova, James Polke, Robyn Labrum, Henry Houlden, Arianna Tucci

PMC · DOI: 10.3390/genes16020169 · Genes · 2025-01-28

## TL;DR

This study evaluates how well exome sequencing can detect DNA repeat expansions in neurological diseases, finding that results depend on the capture kit used.

## Contribution

The study provides insights into the variable genotyping rates of repeat expansions based on exome capture kits and sequencing read lengths.

## Key findings

- 29 repeat expansions were identified and validated in 12,496 exomes, with 48% being diagnostic.
- Genotyping rates varied significantly depending on the exome capture kit and genomic location of the repeats.
- Some intronic repeats, like NOP56 and DMPK, showed high genotyping rates despite general low performance in intronic regions.

## Abstract

Background/Objectives: Short tandem repeat expansions are the most common cause of inherited neurological diseases. These disorders are clinically and genetically heterogeneous, such as in myotonic dystrophy and spinocerebellar ataxia, and they are caused by different repeat motifs in different genomic locations. Major advances in bioinformatic tools used to detect repeat expansions from short read sequencing data in the last few years have led to the implementation of these workflows into next generation sequencing pipelines in healthcare. Here, we aimed to evaluate the clinical utility of analysing repeat expansions through exome sequencing in a large cohort of genetically undiagnosed patients with neurological disorders. Methods: We here analyse 27 disease-causing DNA repeats found in the coding, intronic and untranslated regions in 12,496 exomes in patients with a range of neurogenetic conditions. Results: We identified—and validated by polymerase chain reaction—29 repeat expansions across a range of loci, 48% (n = 14) of which were diagnostic. We then analysed the genotyping performance across all repeat loci and found that, despite high coverage in most repeats in coding regions, some loci had low genotyping rates, such as those that cause spinocerebellar ataxia 2 (ATXN2, 0.1–8.4%) and Huntington disease (HTT, 0.2–58.2%), depending on the capture kit. Conversely, while most intronic repeats were not genotyped, we found a high genotyping rate in the intronic locus that causes spinocerebellar ataxia 36 (NOP56, 30.1–98.3%) and in the one that causes myotonic dystrophy type 1 (DMPK, myotonic dystrophy type 1). Conclusions: We show that the key factors that influence the genotyping rate of repeat expansion loci analysis are the sequencing read length and exome capture kit. These results provide important information about the performance of exome sequencing as a genetic test for repeat expansion disorders.

## Linked entities

- **Genes:** ATXN2 (ataxin 2) [NCBI Gene 6311], HTT (huntingtin) [NCBI Gene 3064], NOP56 (NOP56 ribonucleoprotein) [NCBI Gene 10528], DMPK (DM1 protein kinase) [NCBI Gene 1760]
- **Diseases:** myotonic dystrophy (MONDO:0016107), spinocerebellar ataxia (MONDO:0000437), Huntington disease (MONDO:0007739)

## Full-text entities

- **Genes:** DMPK (DM1 protein kinase) [NCBI Gene 1760] {aka DM, DM1, DM1PK, DMK, MDPK, MT-PK}, NOP56 (NOP56 ribonucleoprotein) [NCBI Gene 10528] {aka NOL5A, SCA36}, ATXN2 (ataxin 2) [NCBI Gene 6311] {aka ATX2, SCA2, TNRC13}
- **Diseases:** inherited neurological diseases (MESH:D030342), myotonic dystrophy (MESH:D009223), HTT (MESH:D006816), Neurological Diseases (MESH:D020271), neurological disorders (MESH:D009461), spinocerebellar ataxia (MESH:D020754)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11855749/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11855749/full.md

## References

12 references — full list in the complete paper: https://tomesphere.com/paper/PMC11855749/full.md

---
Source: https://tomesphere.com/paper/PMC11855749