GenePheno: Interpretable Gene Knockout-Induced Phenotype Abnormality Prediction from Gene Sequences
Jingquan Yan, Yuwei Miao, Lei Yu, Yuzhi Guo, Xue Xiao, Lin Xu, Junzhou Huang

TL;DR
GenePheno is an interpretable machine learning framework that predicts phenotypic abnormalities caused by gene knockouts directly from gene sequences, advancing understanding of gene-phenotype relationships.
Contribution
It introduces the first interpretable multi-label prediction model for knockout-induced phenotypes from gene sequences, incorporating a contrastive learning objective and biological regularization.
Findings
Achieves state-of-the-art gene-centric Fmax and phenotype AUC across datasets.
Demonstrates ability to reveal gene functional mechanisms.
Supports scalable, generalizable phenotype prediction from sequences.
Abstract
Exploring how genetic sequences shape phenotypes is a fundamental challenge in biology and a key step toward scalable, hypothesis-driven experimentation. The task is complicated by the large modality gap between sequences and phenotypes, as well as the pleiotropic nature of gene-phenotype relationships. Existing sequence-based efforts focus on the degree to which variants of specific genes alter a limited set of phenotypes, while general gene knockout induced phenotype abnormality prediction methods heavily rely on curated genetic information as inputs, which limits scalability and generalizability. As a result, the task of broadly predicting the presence of multiple phenotype abnormalities under gene knockout directly from gene sequences remains underexplored. We introduce GenePheno, the first interpretable multi-label prediction framework that predicts knockout induced phenotypic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsBioinformatics and Genomic Networks · Biomedical Text Mining and Ontologies · Gene expression and cancer classification
