# NanoPrePro: a fully equipped, fast, and memory-efficient preprocessor for nanopore transcriptomic sequencing

**Authors:** Chia-Chen Chu, Jhong-He Yu, Shang-Che Kuo, Fan-Wei Yang, Chia-Chang Lin, Chang-Hung Chen, Yi-Chen Wu, Cing Shih, Ying-Hsuan Sun, Te-Lun Mai, Ying-Lan Chen, Hsin-Hung Lin, Jung-Chen Su, Ying-Chung Jimmy Lin

PMC · DOI: 10.1093/bib/bbag063 · 2026-02-13

## TL;DR

NanoPrePro is a fast and efficient tool for processing nanopore sequencing data, offering better performance than existing methods.

## Contribution

NanoPrePro introduces a self-optimizing function and is significantly faster and more memory-efficient than current tools.

## Key findings

- NanoPrePro outperforms Pychopper in simulated and real datasets.
- It is 38 times faster with lower memory usage.
- The tool offers customizable parameters for better precision in read preprocessing.

## Abstract

NanoPrePro is a streamlined read preprocessor specifically designed for high precision in identifying full-length reads from Oxford Nanopore Technology (ONT) transcriptomic sequencing results, achieved through the precise identification of adapters/primers. However, the preprocessing of ONT reads has been a long-term neglected and ambiguous area without thorough and systematic investigation. Here, we developed NanoPrePro that outperformed the current best preprocessor, Pychopper, using simulated and real datasets. Through sequence similarity, adapter/primer location, and adapter/primer length, NanoPrePro exerted a self-optimizing function to extract the best parameters in each sequencing file for users to customize their analyses. Furthermore, NanoPrePro shows a 38-times faster speed with less memory cost. NanoPrePro can be regarded as the state-of-the-art preprocessor with forward adaptability of ONT sequencing.

## Full-text entities

- **Diseases:** APs (MESH:D018420), cancer (MESH:D009369), ONT (MESH:C000719218)
- **Chemicals:** PCS111 (-)
- **Species:** Eucalyptus grandis (rose gum, species) [taxon 71139], Liriodendron chinense (species) [taxon 3414], Mus musculus (house mouse, species) [taxon 10090], Populus (poplar, genus) [taxon 3689], Populus trichocarpa (black cottonwood, species) [taxon 3694], Homo sapiens (human, species) [taxon 9606]
- **Mutations:** A49A
- **Cell lines:** 293T — Homo sapiens (Human), Transformed cell line (CVCL_0063), Ptr_111 — Sus scrofa (Pig), Spontaneously immortalized cell line (CVCL_YB17)

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12903951/full.md

---
Source: https://tomesphere.com/paper/PMC12903951