# SquiDBase: a community resource of raw nanopore data from microbes

**Authors:** Wim L Cuypers, Halil Ceylan, Eline Turcksin, Laura Raes, Nicky de Vrij, Johan Michiels, Sandra Coppens, Tessa de Block, Daan Jansen, Kevin K Ariën, Philippe Selhorst, Koen Vercauteren, Julia M Gauglitz, Wout Bittremieux, Kris Laukens

PMC · DOI: 10.1093/nargab/lqaf213 · NAR Genomics and Bioinformatics · 2026-01-08

## TL;DR

SquiDBase is a new open-access platform for sharing raw nanopore sequencing data from microbes, improving reproducibility and tool development.

## Contribution

SquiDBase introduces a centralized repository and pipeline for raw nanopore data, enabling better data sharing and computational benchmarking.

## Key findings

- SquiDBase includes raw data for 24 clinically relevant viruses and public reference datasets.
- The platform supports automated processing and removal of human reads via the SquiDPipe pipeline.
- SquiDBase promotes open science by enabling standardized sharing of raw nanopore data.

## Abstract

Nucleotide sequences in the FASTQ or BAM format are widely shared, yet derived from platform-specific raw data outputs that differ across sequencing platforms. In Oxford Nanopore Technologies (ONT) sequencing, raw signal data contain valuable biological information and enable basecaller optimization and modification detection. These raw signals also underpin algorithms that could improve ONT device portability and enhance target enrichment efficiency through adaptive sampling. Nevertheless, the storage and sharing of raw nanopore data remain limited due to technical constraints and the lack of standardized and centralized infrastructure. To address this challenge, we developed SquiDBase (https://squidbase.org), a dedicated repository for raw microbial nanopore sequencing data with linked processed data and metadata. To maximize immediate utility, we built SquiDPipe, a Nextflow pipeline for the automated removal of human reads from raw nanopore data, sequenced 24 clinically relevant viruses and incorporated them into SquiDBase, and added publicly available reference datasets and new community contributions. By offering a centralized, open-access raw data collection platform, SquiDBase facilitates data sharing, enhances reproducibility, and supports the development and benchmarking of computational tools, reinforcing open science in nanopore sequencing.

Graphical Abstract

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12783041/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC12783041/full.md

## References

41 references — full list in the complete paper: https://tomesphere.com/paper/PMC12783041/full.md

---
Source: https://tomesphere.com/paper/PMC12783041