# GGC repeat expansions within new open reading frames are translated into toxic polyglycine proteins in oculopharyngodistal myopathy

**Authors:** Manon Boivin, Jiaxi Yu, Nobuyuki Eura, Léa Schmitt, David Pietri, Erwan Grandgirard, Patrice Goetz-Reiner, Damien Plassard, Chadia Nahy, Anne Maglott, Bastien Morlet, Chao Gao, Elise Lefebvre, Muriel Philipps, Pascal Eberling, Angélique Pichot, Paola Rossolillo, Christelle Thibault, Mustapha Oulad-Abdelghani, Ichizo Nishino, Kang Yang, Ning Wang, Zhaoxia Wang, Jianwen Deng, Nicolas Charlet-Berguerand

PMC · DOI: 10.1038/s41588-026-02507-z · Nature Genetics · 2026-02-17

## TL;DR

This study shows that GGC repeat expansions in previously unknown DNA regions cause toxic proteins linked to a rare muscle and brain disease.

## Contribution

The discovery that GGC expansions in small ORFs produce toxic polyglycine proteins is novel and expands understanding of disease mechanisms.

## Key findings

- GGC expansions in new ORFs are translated into polyglycine proteins that form p62-positive inclusions.
- Polyglycine proteins cause muscle and neurological issues in multiple model systems.
- The compound TMPyP4 reduces expression of these toxic proteins, suggesting a potential therapy.

## Abstract

A total of 3–6% human genome is composed of microsatellite sequences, which are short DNA elements composed of two to six nucleotide motifs repeated in tandem. Expansion of a subset of these microsatellites is the leading cause of >60 diseases. However, most of these mutations are located in sequences annotated as noncoding, which raises questions about their pathogenicity. Here we found that GGC repeat expansions causing oculopharyngodistal myopathy with or without oculopharyngeal myopathy leukoencephalopathy are located within previously unrecognized open reading frames (ORFs), resulting in their translation into new polyglycine-containing proteins. Antibodies developed against these proteins stain the p62-positive inclusions typical of these diseases. Moreover, expression of these polyglycine proteins causes locomotor and skeletal muscle alterations associated with neurodegeneration in cell, fly and mouse models. Finally, we identified a compound, the cationic porphyrin TMPyP4, targeting the expression of these polyglycine proteins, raising hope to develop a therapy for these disorders. Overall, this work highlights the complexity and richness of the human genome and the importance of mutations in yet-unrecognized small ORFs.

GGC repeat expansions causing oculopharyngeal myopathy with or without leukoencephalopathy are located in small open reading frames and translated into polyglycine proteins that form cellular aggregates, driving neuronal and muscle cell dysfunction.

## Linked entities

- **Proteins:** GTF2H1 (general transcription factor IIH subunit 1)
- **Diseases:** oculopharyngodistal myopathy (MONDO:0020793)

## Full-text entities

- **Genes:** NUP62 (nucleoporin 62) [NCBI Gene 23636] {aka IBSN, SNDI, p62}
- **Diseases:** oculopharyngeal myopathy (MESH:D039141), leukoencephalopathy (MESH:D056784), oculopharyngodistal myopathy (MESH:C563508), neurodegeneration (MESH:D019636)
- **Chemicals:** TMPyP4 (MESH:C021096), polyglycine (MESH:C011080), porphyrin (MESH:D011166)
- **Species:** Mus musculus (house mouse, species) [taxon 10090], Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12987736/full.md

## Figures

15 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12987736/full.md

---
Source: https://tomesphere.com/paper/PMC12987736