A Robust Approach for Identifying Gene-Environment Interactions for Prognosis
Hao Chai, Qingzhao Zhang, Yu Jiang, Guohua Wang, Sanguo Zhang and, Shuangge Ma

TL;DR
This paper introduces a robust statistical method for identifying gene-environment interactions affecting prognosis, effectively handling data contamination and outperforming existing methods in simulations and real cancer data analysis.
Contribution
It proposes a novel robust AFT model with exponential squared loss and regularization, improving gene-environment interaction detection under contaminated data conditions.
Findings
Outperforms unrobust methods in contaminated data scenarios.
Identifies novel gene-environment interactions in lung cancer data.
Demonstrates stability and clinical relevance of identified markers.
Abstract
For many complex diseases, prognosis is of essential importance. It has been shown that, beyond the main effects of genetic (G) and environmental (E) risk factors, the gene-environment (GE) interactions also play a critical role. In practice, the prognosis outcome data can be contaminated, and most of the existing methods are not robust to data contamination. In the literature, it has been shown that even a single contaminated observation can lead to severely biased model estimation. In this study, we describe prognosis using an accelerated failure time (AFT) model. An exponential squared loss is proposed to accommodate possible data contamination. A penalization approach is adopted for regularized estimation and marker selection. The proposed method is realized using an effective coordinate descent (CD) and minorization maximization (MM) algorithm. Simulation shows that without…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Bioinformatics and Genomic Networks · Genetic Associations and Epidemiology
