# Long-read sequencing disentangles isoform complexity at allele-specific loci

**Authors:** Lison Lemoine, Sarah Hoelzl, Tim P. Hasenbein, Elisabeth Graf, Daniel Andergassen

PMC · DOI: 10.1038/s41598-025-97362-z · Scientific Reports · 2025-11-11

## TL;DR

Long-read sequencing helps understand complex gene isoforms by distinguishing between alleles, offering new insights into gene regulation.

## Contribution

A workflow combining PacBio Iso-Seq and WhatsHap enables allele-specific isoform analysis in polymorphic hybrids.

## Key findings

- Long-read sequencing confirmed imprinting status of known imprinted genes in F1 mouse hybrids.
- The Gnas locus's complex isoform structure was benchmarked using long-read data.
- Isoforms from both active and inactive X chromosomes were detected for genes escaping X inactivation.

## Abstract

In recent years, long-read sequencing technologies have detected transcript isoforms with unprecedented accuracy and resolution. However, it remains unclear whether long-read sequencing can effectively disentangle the isoform landscape of complex allele-specific loci that arise from genetic or epigenetic differences between alleles. Here, we combine the PacBio Iso-Seq workflow with the established phasing approach WhatsHap to assign long reads to the corresponding allele in polymorphic F1 mouse hybrids. Upon comparing the long-read sequencing results with matched short reads, we observed general consistency in the allele-specific information and were able to confirm the imprinting status of known imprinted genes. We then explored the complex imprinted Gnas locus known for allele-specific non-coding and coding isoforms and were able to benchmark historical observations. This approach also allowed us to detect isoforms from both the active and inactive X chromosomes of genes that escape X chromosome inactivation. The described workflow offers a promising framework and demonstrates the power of long-read transcriptomic data to provide mechanistic insight into complex allele-specific loci.

## Linked entities

- **Genes:** GNAS (GNAS complex locus) [NCBI Gene 2778]
- **Species:** Mus musculus (taxon 10090)

## Full-text entities

- **Genes:** Gnas (GNAS complex locus) [NCBI Gene 14683] {aka 5530400H20Rik, A930027G11Rik, C130027O20Rik, GPSA, GSP, Galphas}
- **Species:** Mus musculus (house mouse, species) [taxon 10090]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12606347/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12606347/full.md

## References

4 references — full list in the complete paper: https://tomesphere.com/paper/PMC12606347/full.md

---
Source: https://tomesphere.com/paper/PMC12606347