Active learning for efficient discovery of optimal gene combinations in the combinatorial perturbation space
Jason Qin, Hans-Hermann Wessels, Carlos Fernandez-Granda, Yuhan Hao

TL;DR
This paper presents NAIAD, an active learning framework that efficiently identifies optimal gene combinations in CRISPR screens, significantly reducing experimental efforts and accelerating discovery of effective therapies.
Contribution
NAIAD introduces adaptive gene embeddings and active learning strategies to improve gene combination discovery in large combinatorial spaces.
Findings
NAIAD outperforms existing models by up to 40% on multiple datasets.
It accelerates discovery with fewer CRISPR experiments.
NAIAD effectively captures complex gene interactions as more data is collected.
Abstract
The advancement of novel combinatorial CRISPR screening technologies enables the identification of synergistic gene combinations on a large scale. This is crucial for developing novel and effective combination therapies, but the combinatorial space makes exhaustive experimentation infeasible. We introduce NAIAD, an active learning framework that efficiently discovers optimal gene pairs capable of driving cells toward desired cellular phenotypes. NAIAD leverages single-gene perturbation effects and adaptive gene embeddings that scale with the training data size, mitigating overfitting in small-sample learning while capturing complex gene interactions as more data is collected. Evaluated on four CRISPR combinatorial perturbation datasets totaling over 350,000 genetic interactions, NAIAD, trained on small datasets, outperforms existing models by up to 40\% relative to the second-best.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDNA and Biological Computing · Gene Regulatory Network Analysis · Gene expression and cancer classification
MethodsLib
