# ASET: An end-to-end pipeline for quantification and visualization of allele specific expression

**Authors:** Weisheng Wu, Kerby Shedden, Claudius Vincenz, Chris Gates, Beverly Strassmann

PMC · DOI: 10.21203/rs.3.rs-6844336/v1 · 2025-06-13

## TL;DR

ASET is a complete pipeline for analyzing allele-specific expression from RNA-Seq data, making it easier to study genetic effects on gene expression.

## Contribution

ASET introduces a modular, end-to-end pipeline for ASE analysis with integrated tools for quantification, visualization, and PofO testing.

## Key findings

- ASET includes a Nextflow pipeline for SNP-level ASE quantification.
- ASET provides an R library for data visualization and a Julia script for PofO testing.
- ASET handles read quality control, alignment, counting, annotation, and contamination estimation.

## Abstract

Allele-specific expression (ASE) analyses from RNA-Seq data provide quantitative insights into imprinting and genetic variants affecting transcription. Robust ASE analysis requires the integration of multiple computational steps, including read alignment, read counting, data visualization, and statistical testing—this complexity creates challenges around reproducibility, scalability, and ease of use.

Here, we present ASE Toolkit (ASET), an end-to-end pipeline that streamlines SNP-level ASE data generation, visualization, and testing for parent-of-origin (PofO) effect. ASET includes a modular pipeline built with Nextflow for ASE quantification from short-read transcriptome sequencing reads, an R library for data visualization, and a Julia script for PofO testing. ASET performs comprehensive read quality control, SNP-tolerant alignment to reference genomes, read counting with allele and strand resolution, annotation with genes and exons, and estimation of contamination. In sum, ASET provides a complete and easy-to-use solution for molecular and biomedical scientists to identify and interpret patterns in ASE from RNA-Seq data.

## Full-text entities

- **Genes:** STAR (steroidogenic acute regulatory protein) [NCBI Gene 6770] {aka STARD1}, RHOBTB3 (Rho related BTB domain containing 3) [NCBI Gene 22836], WAS (WASP actin nucleation promoting factor) [NCBI Gene 7454] {aka IMD2, SCNX, THC, THC1, WASP, WASPA}
- **Diseases:** ASET (MESH:D001039)
- **Chemicals:** PofO (-)
- **Species:** Homo sapiens (human, species) [taxon 9606], Mus musculus (house mouse, species) [taxon 10090]

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12204500/full.md

---
Source: https://tomesphere.com/paper/PMC12204500