BatMan: Mitigating Batch Effects via Stratification for Survival Outcome Prediction
Ai Ni, Mengling Liu, Li-Xuan Qin

TL;DR
This paper introduces BatMan, a new method for mitigating batch effects in survival outcome prediction from transcriptomics data, outperforming existing methods like ComBat especially when data normalization is also applied.
Contribution
The paper proposes BatMan, a novel stratification-based approach for batch effect correction in survival analysis, addressing limitations of existing methods like ComBat.
Findings
BatMan outperforms ComBat in most simulated scenarios with batch effects.
Adding data normalization can worsen prediction performance.
BatMan demonstrates superior performance on ovarian cancer microRNA data.
Abstract
Reproducible translation of transcriptomics data has been hampered by the ubiquitous presence of batch effects. Statistical methods for managing batch effects were initially developed in the setting of sample group comparison and later borrowed for other settings such as survival outcome prediction. The most notable such method is ComBat, which adjusts for batches by including it as a covariate alongside sample groups in a linear regression. In survival prediction, however, ComBat is used without definable groups for survival outcome and is done sequentially with survival regression for a potentially confounded outcome. To address these issues, we propose a new method, called BatMan ("BATch MitigAtion via stratificatioN"). It adjusts batches as strata in survival regression and utilize variable selection methods such as LASSO to handle high dimensionality. We assess the performance of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCancer-related molecular mechanisms research · Gene expression and cancer classification · Molecular Biology Techniques and Applications
