G2PDiffusion: Cross-Species Genotype-to-Phenotype Prediction via Evolutionary Diffusion
Mengdi Liu, Zhangyang Gao, Hong Chang, Stan Z. Li, Shiguang Shan,, Xilin Chen

TL;DR
G2PDiffusion is a novel diffusion model that predicts cross-species phenotypes from genotypes by integrating evolutionary signals and environmental contexts, enabling better generalization and understanding of genotype-phenotype relationships.
Contribution
This work introduces the first genotype-to-phenotype diffusion model leveraging evolutionary signals and environmental data for cross-species phenotype prediction.
Findings
Model effectively incorporates evolutionary signals and environment.
Enables cross-species genotype-to-phenotype prediction.
Improves understanding of phenotype variability across species.
Abstract
Understanding how genes influence phenotype across species is a fundamental challenge in genetic engineering, which will facilitate advances in various fields such as crop breeding, conservation biology, and personalized medicine. However, current phenotype prediction models are limited to individual species and expensive phenotype labeling process, making the genotype-to-phenotype prediction a highly domain-dependent and data-scarce problem. To this end, we suggest taking images as morphological proxies, facilitating cross-species generalization through large-scale multimodal pretraining. We propose the first genotype-to-phenotype diffusion model (G2PDiffusion) that generates morphological images from DNA considering two critical evolutionary signals, i.e., multiple sequence alignments (MSA) and environmental contexts. The model contains three novel components: 1) a MSA retrieval…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification
MethodsDiffusion
