# HPTAS: An Alignment-Free Haplotype Phasing Algorithm Focused on Allele-Specific Studies Using Transcriptome Data

**Authors:** Jianan Wang, Zhenyuan Sun, Guohua Wang, Yan Miao

PMC · DOI: 10.3390/ijms26125700 · International Journal of Molecular Sciences · 2025-06-13

## TL;DR

HPTAS is a new alignment-free method for haplotype phasing that improves accuracy in allele-specific studies using RNA-seq data.

## Contribution

HPTAS introduces a k-mer-based, alignment-free approach for haplotype phasing focused on exonic SNPs in allele-specific studies.

## Key findings

- HPTAS achieves high phasing accuracy and performance on both NGS and TGS RNA-seq data.
- Transcriptome data significantly improves the phasing of exonic SNPs in allele-specific studies.
- HPTAS outperforms existing methods in haplotype phasing for allele-specific expression and splicing analyses.

## Abstract

Haplotype phasing refers to determining the haplotype sequences inherited from each parent in a diploid organism. It is a critical process for various downstream analyses, and numerous haplotype phasing methods for genomic single nucleotide polymorphisms (SNPs) have been developed. Allele-specific (AS) expression and alternative splicing play key roles in diverse biological processes. AS studies usually focus more on exonic SNPs, and multiple phased SNPs need to be combined to obtain better inferences. In this paper, we introduce an alignment-free algorithm HPTAS for haplotype phasing in AS studies. Instead of using sequence alignment to count the number of reads covering SNPs, HPTAS constructs a mapping structure from transcriptome annotations and SNPs and employs a k-mer-based approach to derive phasing counts from RNA-seq data. Using both next-generation sequencing (NGS) and the third-generation sequencing (TGS) NA12878 RNA-seq data and comparing with the most advanced algorithm in the field, we have demonstrated that HPTAS achieves high phasing accuracy and performance and that transcriptome data indeed facilitates the phasing of exonic SNPs. With the continued advancement of sequencing technology and the improvement in transcriptome annotations, HPTAS may serve as a foundation for future haplotype phasing methods.

## Full-text entities

- **Genes:** PPIE (peptidylprolyl isomerase E) [NCBI Gene 10450] {aka CYP-33, CYP33, CypE}
- **Diseases:** AS (MESH:D000080888), injury to (MESH:D014947), TGS (MESH:D015840)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Mutations:** 40218695 (A/G)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12193515/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12193515/full.md

## References

40 references — full list in the complete paper: https://tomesphere.com/paper/PMC12193515/full.md

---
Source: https://tomesphere.com/paper/PMC12193515