# CADBURE: A generic tool to evaluate the performance of spliced aligners on RNA-Seq data

**Authors:** Praveen Kumar Raj Kumar, Thanh V. Hoang, Michael L. Robinson, Panagiotis A. Tsonis, Chun Liang

PMC · DOI: 10.1038/srep13443 · Scientific Reports · 2015-08-25

## TL;DR

CADBURE is a tool that helps choose the best RNA-Seq alignment method by evaluating uniquely aligned reads, improving gene expression analysis accuracy.

## Contribution

CADBURE introduces a novel method to compare RNA-Seq aligners using real data without simulation, reducing false positives in gene expression analysis.

## Key findings

- Using CADBURE can change the number of differentially expressed genes by up to 10%.
- CADBURE reduces false positives in differential gene expression analysis.
- Eighteen genes showed validated differential expression via RT-qPCR.

## Abstract

The fundamental task in RNA-Seq-based transcriptome analysis is alignment of millions of short reads to the reference genome or transcriptome. Choosing the right tool for the dataset in hand from many existent RNA-Seq alignment packages remains a critical challenge for downstream analysis. To facilitate this choice, we designed a novel tool for comparing alignment results of user data based on the relative reliability of uniquely aligned reads (CADBURE). CADBURE can easily evaluate different aligners, or different parameter sets using the same aligner, and selects the best alignment result for any RNA-Seq dataset. Strengths of CADBURE include the ability to compare alignment results without the need for synthetic data such as simulated genomes, alignment regeneration and randomly subsampled datasets. The benefit of a CADBURE selected alignment result was supported by differentially expressed gene (DEG) analysis. We demonstrated that the use of CADBURE to select the best alignment from a number of different alignment results could change the number of DEGs by as much as 10%. In particular, the CADBURE selected alignment result favors fewer false positives in the DEG analysis. We also verified differential expression of eighteen genes with RT-qPCR validation experiments. CADBURE is an open source tool (http://cadbure.sourceforge.net/).

## Full-text entities

- **Genes:** CLC [NCBI Gene 12735], Rpl12 (ribosomal protein L12) [NCBI Gene 269261] {aka E430018F03}
- **Chemicals:** BAM (-), poly (A/T) (MESH:C008950)
- **Species:** Homo sapiens (human, species) [taxon 9606], Mus musculus (house mouse, species) [taxon 10090]
- **Cell lines:** S2 — Drosophila melanogaster (Fruit fly), Spontaneously immortalized cell line (CVCL_Z232), FVB/N — Mus musculus (Mouse), Transformed cell line (CVCL_C0MX)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC4548254/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC4548254/full.md

## References

29 references — full list in the complete paper: https://tomesphere.com/paper/PMC4548254/full.md

---
Source: https://tomesphere.com/paper/PMC4548254