A statistical normalization method and differential expression analysis for RNA-seq data between different species
Yan Zhou, Jiadi Zhu, Tiejun Tong, Junhui Wang, Bingqing Lin, Jun, Zhang

TL;DR
This paper introduces a novel normalization method for RNA-seq data across different species, improving the accuracy of differential gene expression analysis by accounting for conserved orthologous genes and technical variations.
Contribution
The paper presents the SCBN normalization method that incorporates conserved orthologous gene knowledge and hypothesis testing to enhance cross-species RNA-seq analysis.
Findings
SCBN outperforms existing normalization methods in simulations.
The method effectively accounts for gene length and unmapped genes.
Application to real data confirms improved differential expression detection.
Abstract
Background: High-throughput techniques bring novel tools but also statistical challenges to genomic research. Identifying genes with differential expression between different species is an effective way to discover evolutionarily conserved transcriptional responses. To remove systematic variation between different species for a fair comparison, the normalization procedure serves as a crucial pre-processing step that adjusts for the varying sample sequencing depths and other confounding technical effects. Results: In this paper, we propose a scale based normalization (SCBN) method by taking into account the available knowledge of conserved orthologous genes and hypothesis testing framework. Considering the different gene lengths and unmapped genes between different species, we formulate the problem from the perspective of hypothesis testing and search for the optimal scaling factor…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Molecular Biology Techniques and Applications · Genomics and Phylogenetic Studies
