OmniGenBench: Automating Large-scale in-silico Benchmarking for Genomic Foundation Models
Heng Yang, Jack Cole, Ke Li

TL;DR
OmniGenBench is a comprehensive, open-source benchmarking framework that standardizes and automates the evaluation of genomic foundation models across diverse tasks, accelerating research and application in genomics.
Contribution
It introduces GFMBench, a standardized, automated benchmarking platform for GFM evaluation, with a public leaderboard and user-friendly tools to foster progress in genomic modeling.
Findings
Integrated millions of genomic sequences across tasks
Standardized benchmark suites for GFM evaluation
Launched a public leaderboard for GFM performance
Abstract
The advancements in artificial intelligence in recent years, such as Large Language Models (LLMs), have fueled expectations for breakthroughs in genomic foundation models (GFMs). The code of nature, hidden in diverse genomes since the very beginning of life's evolution, holds immense potential for impacting humans and ecosystems through genome modeling. Recent breakthroughs in GFMs, such as Evo, have attracted significant investment and attention to genomic modeling, as they address long-standing challenges and transform in-silico genomic studies into automated, reliable, and efficient paradigms. In the context of this flourishing era of consecutive technological revolutions in genomics, GFM studies face two major challenges: the lack of GFM benchmarking tools and the absence of open-source software for diverse genomics. These challenges hinder the rapid evolution of GFMs and their wide…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · Genomics and Phylogenetic Studies
MethodsSoftmax · Attention Is All You Need
