# Long-Read Sequencing Outperforms Short-Read Sequencing in Detecting Most Structural Variations

**Authors:** Xinyue Chen, Xiaodong Lu, Xianglin Shi, Shaojun Yu, Jonathan Zhao

PMC · DOI: 10.17161/sjm.v2i2.23671 · Serican journal of medicine · 2026-02-09

## TL;DR

Long-read sequencing detects most structural variations in cancer genomes better than short-read sequencing, except for very long deletions.

## Contribution

A direct comparison of long-read and short-read sequencing for structural variation detection in a cancer genome.

## Key findings

- Long-read sequencing detects insertions and small deletions more accurately than short-read sequencing.
- Short-read sequencing is better at detecting long deletions due to paired-end read advantages.
- Long-read sequencing provides more precise breakpoints and reduces errors in repetitive regions.

## Abstract

Structural variations (SV) are common in the cancer genome and play critical roles in regulating tumorigenesis. In the past decades, many SVs have been detected through analyses of whole-genome sequencing (WGS) data generated mainly by Illumina paired-end short-read sequencing (SRS). Recent advances in long-read sequencing (LRS) techniques provide exciting opportunities for SV detection. However, a comprehensive analysis of the pros and cons of LRS and SRS in detecting SVs in a cancer genome is still lacking. Here, we performed WGS of the LNCaP prostate cancer cell line through LRS using the Oxford Nanopore Technology and called main SVs, which were compared to those derived from publicly available LNCaP SRS data. Strikingly, LRS is superior in detecting insertions of all sizes and deletions of <1000 bp long, whereas SRS is very useful in capturing long deletions, taking advantage of its paired-end reads. LRS identified more precise breakpoints of detected SVs. In addition, we found that SRS called many duplications and inversions, most of which were not confirmed by LRS, likely due to ambiguity in SRS read alignment to repetitive regions, leading to errors in SV calling. In conclusion, LRS outperformed SRS in detecting most SVs, except deletions longer than LRS read lengths. Our study highlights the advantages of LRS in resolving complex genomic rearrangements and underscores its potential for improving SV detection in cancer genomics.

## Linked entities

- **Diseases:** cancer (MONDO:0004992)

## Full-text entities

- **Genes:** MIPOL1 (mirror-image polydactyly 1) [NCBI Gene 145282] {aka CCDC193}, AR (androgen receptor) [NCBI Gene 367] {aka AIS, AR8, DHTR, HPCX3, HUMARA, HYSP1}, DGKB (diacylglycerol kinase beta) [NCBI Gene 1607] {aka DAGK2, DGK, DGK-BETA}, SRS [NCBI Gene 140821], TMPRSS2 (transmembrane serine protease 2) [NCBI Gene 7113] {aka PRSS10}, ERG (ETS transcription factor ERG) [NCBI Gene 2078] {aka LMPHM14, erg-3, p55}
- **Diseases:** cancer (MESH:D009369), tumorigenesis (MESH:D063646), SV (MESH:D020914), LRS (MESH:D004410), prostate cancer (MESH:D011471)
- **Chemicals:** LRS (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** LNCaP — Homo sapiens (Human), Prostate carcinoma, Cancer cell line (CVCL_0395)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12883271/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12883271/full.md

## References

23 references — full list in the complete paper: https://tomesphere.com/paper/PMC12883271/full.md

---
Source: https://tomesphere.com/paper/PMC12883271