# invMap: a sensitive mapping tool for long noisy reads with inversion structural variants

**Authors:** Ze-Gang Wei, Peng-Yu Bu, Xiao-Dan Zhang, Fei Liu, Yu Qian, Fang-Xiang Wu

PMC · DOI: 10.1093/bioinformatics/btad726 · 2023-12-07

## TL;DR

invMap is a new tool for accurately mapping long, noisy DNA sequences to detect inversion structural variants, which are hard to identify with existing methods.

## Contribution

invMap introduces a novel mapping algorithm that improves inversion detection accuracy in long-read sequencing data.

## Key findings

- invMap outperforms existing methods in locating aligned regions and calling inversion structural variants.
- Testing on simulated and real human genome data shows invMap detects more inversion variants than competing tools.
- The method is robust across different genomes and sequencing coverages.

## Abstract

Longer reads produced by PacBio or Oxford Nanopore sequencers could more frequently
span the breakpoints of structural variations (SVs) than shorter reads. Therefore,
existing long-read mapping methods often generate wrong alignments and variant calls.
Compared to deletions and insertions, inversion events are more difficult to be detected
since the anchors in inversion regions are nonlinear to those in SV-free regions. To
address this issue, this study presents a novel long-read mapping algorithm (named as
invMap).

For each long noisy read, invMap first locates the aligned region with a specifically
designed scoring method for chaining, then checks the remaining anchors in the aligned
region to discover potential inversions. We benchmark invMap on simulated datasets
across different genomes and sequencing coverages, experimental results demonstrate that
invMap is more accurate to locate aligned regions and call SVs for inversions than the
competing methods. The real human genome sequencing dataset of NA12878 illustrates that
invMap can effectively find more candidate variant calls for inversions than the
competing methods.

The invMap software is available at https://github.com/zhang134/invMap.git.

## Full-text entities

- **Chemicals:** LAMSA (-)
- **Species:** Homo sapiens (human, species) [taxon 9606], Neisseria meningitidis (species) [taxon 487], Abortiporus biennis (blushing rosette, species) [taxon 137743]
- **Cell lines:** S2 — Drosophila melanogaster (Fruit fly), Spontaneously immortalized cell line (CVCL_Z232)

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11320709/full.md

---
Source: https://tomesphere.com/paper/PMC11320709