Causal Inference for Genomic Data with Multiple Heterogeneous Outcomes
Jin-Hong Du, Zhenghao Zeng, Edward H. Kennedy, Larry, Wasserman, Kathryn Roeder

TL;DR
This paper introduces a semiparametric framework for causal inference in genomics using multiple derived outcomes from single-cell RNA sequencing, enabling robust estimation and multiple testing control.
Contribution
It develops a doubly robust inference method for multiple heterogeneous outcomes, including standardized and quantile treatment effects, with tailored multiple testing procedures.
Findings
Demonstrated utility in single-cell CRISPR perturbation analysis
Provided insights into causal estimands in genomics
Controlled false discovery rate in multiple testing
Abstract
With the evolution of single-cell RNA sequencing techniques into a standard approach in genomics, it has become possible to conduct cohort-level causal inferences based on single-cell-level measurements. However, the individual gene expression levels of interest are not directly observable; instead, only repeated proxy measurements from each individual's cells are available, providing a derived outcome to estimate the underlying outcome for each of many genes. In this paper, we propose a generic semiparametric inference framework for doubly robust estimation with multiple derived outcomes, which also encompasses the usual setting of multiple outcomes when the response of each unit is available. To reliably quantify the causal effects of heterogeneous outcomes, we specialize the analysis to standardized average treatment effects and quantile treatment effects. Through this, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic Associations and Epidemiology
MethodsCausal inference
