Performance Evaluation of Transcriptomics Data Normalization for Survival Risk Prediction
Ai Ni, Li-Xuan Qin

TL;DR
This study evaluates how different normalization methods affect survival risk prediction from transcriptomics data, revealing that median and variance stabilizing normalization outperform the commonly used quantile normalization.
Contribution
The paper introduces a benchmarking tool for assessing normalization methods specifically in survival prediction, highlighting the importance of normalization choice in downstream analysis.
Findings
Handling effects significantly impact survival prediction accuracy.
Quantile normalization underperforms compared to median and variance stabilizing normalization.
Normalization evaluation should consider the specific downstream analysis context.
Abstract
One pivotal feature of transcriptomics data is the unwanted variations caused by disparate experimental handling, known as handling effects. Various data normalization methods were developed to alleviate the adverse impact of handling effects in the setting of differential expression analysis. However, little research has been done to evaluate their performance in the setting of survival outcome prediction, an important analysis goal for transcriptomics data in biomedical research. Leveraging a unique pair of datasets for the same set of tumor samples-one with handling effects and the other without, we developed a benchmarking tool for conducting such an evaluation in microRNA microarrays. We applied this tool to evaluate the performance of three popular normalization methods-quantile normalization, median normalization, and variance stabilizing normalization-in survival prediction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMicroRNA in disease regulation · Gene expression and cancer classification · Cancer-related molecular mechanisms research
