# Haplotype Inference Using Long-Read Nanopore Sequencing: Application to GSTA1 Promoter

**Authors:** Vid Mlakar, Isabelle Dupanloup, Yvonne Gloor, Marc Ansari

PMC · DOI: 10.1007/s12033-024-01213-7 · Molecular Biotechnology · 2024-06-17

## TL;DR

This study shows that long-read nanopore sequencing can accurately determine GSTA1 promoter haplotypes without inference, improving clinical haplotype recovery.

## Contribution

Demonstrates the efficacy of Oxford nanopore sequencing for accurate haplotype phasing in the GSTA1 promoter region.

## Key findings

- Nanopore sequencing achieved >90% correct haplotype recovery for SNPs within 200 bp.
- Sequencing accuracy dropped to 58% for SNPs 1089 bp apart, showing distance dependence.
- Hybrid haplotypes were influenced by the number of PCR cycles, not extension or annealing time.

## Abstract

Recovering true haplotypes can have important clinical consequences. The laboratory process is difficult and is, therefore, most often done through inference. In this paper, we show that when using the Oxford nanopore sequencing technology, we could recover the true haplotypes of the GSTA1 promoter region. Eight LCL cell lines with potentially ambiguous haplotypes were used to characterize the efficacy of Oxford nanopore sequencing to phase the correct GSTA1 promoter haplotypes. The results were compared to Sanger sequencing and inferred haplotypes in the 1000 genomes project. The average read length was 813 bp out of a total PCR length of 1336 bp. The best coverage of sequencing was in the middle of the PCR product and decreased to 50% at the PCR ends. SNPs separated by less than 200 bp showed > 90% of correct haplotypes, while at the distance of 1089 bp, this proportion still exceeded 58%. The number of cycles influences the generation of hybrid haplotypes but not extension or annealing time. The results demonstrate that this long sequencing reads methodology, can accurately determine the haplotypes without the need for inference. The technology proved to be robust but the success of phasing nonetheless depends on the distances and frequencies of SNPs.

## Linked entities

- **Genes:** GSTA1 (glutathione S-transferase alpha 1) [NCBI Gene 2938]

## Full-text entities

- **Genes:** GSTA1 (glutathione S-transferase alpha 1) [NCBI Gene 2938] {aka GST-epsilon, GST2, GSTA1-1, GTH1}

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12055866/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12055866/full.md

## References

3 references — full list in the complete paper: https://tomesphere.com/paper/PMC12055866/full.md

---
Source: https://tomesphere.com/paper/PMC12055866