A Standardized Framework For Evaluating Gene Expression Generative Models
Andrea Rubbi, Andrea Giuseppe Di Francesco, Mohammad Lotfollahi, Pietro Li\`o

TL;DR
This paper introduces GGE, an open-source framework that standardizes evaluation of gene expression generative models, addressing inconsistencies and enabling fair benchmarking with biologically relevant metrics.
Contribution
The paper presents GGE, a comprehensive, standardized evaluation framework for gene expression models, incorporating distributional and biological metrics for reproducible benchmarking.
Findings
Metrics vary greatly with implementation choices
Standardization improves comparability of models
GGE accelerates progress in gene expression modeling
Abstract
The rapid development of generative models for single-cell gene expression data has created an urgent need for standardised evaluation frameworks. Current evaluation practices suffer from inconsistent metric implementations, incomparable hyperparameter choices, and a lack of biologically-grounded metrics. We present Generated Genetic Expression Evaluator (GGE), an open-source Python framework that addresses these challenges by providing a comprehensive suite of distributional metrics with explicit computation space options and biologically-motivated evaluation through differentially expressed gene (DEG)-focused analysis and perturbation-effect correlation, enabling standardized reporting and reproducible benchmarking. Through extensive analysis of the single-cell generative modeling literature, we identify that no standardized evaluation protocol exists. Methods report incomparable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSingle-cell and spatial transcriptomics · Gene expression and cancer classification · Bioinformatics and Genomic Networks
