HybriDNA: A Hybrid Transformer-Mamba2 Long-Range DNA Language Model
Mingqian Ma, Guoqing Liu, Chuan Cao, Pan Deng, Tri Dao, Albert Gu,, Peiran Jin, Zhao Yang, Yingce Xia, Renqian Luo, Pipi Hu, Zun Wang, Yuan-Jyue, Chen, Haiguang Liu, Tao Qin

TL;DR
HybriDNA is a novel hybrid Transformer-Mamba2 DNA language model capable of processing ultra-long DNA sequences with single-nucleotide resolution, achieving state-of-the-art results in understanding and generating DNA data across various benchmarks.
Contribution
This paper introduces HybriDNA, a hybrid Transformer-Mamba2 architecture that efficiently models ultra-long DNA sequences and improves performance on multiple DNA understanding and generation tasks.
Findings
HybriDNA processes sequences up to 131kb with high accuracy.
It achieves state-of-the-art results on 33 DNA datasets.
Performance scales positively from 300M to 7B parameters.
Abstract
Advances in natural language processing and large language models have sparked growing interest in modeling DNA, often referred to as the "language of life". However, DNA modeling poses unique challenges. First, it requires the ability to process ultra-long DNA sequences while preserving single-nucleotide resolution, as individual nucleotides play a critical role in DNA function. Second, success in this domain requires excelling at both generative and understanding tasks: generative tasks hold potential for therapeutic and industrial applications, while understanding tasks provide crucial insights into biological mechanisms and diseases. To address these challenges, we propose HybriDNA, a decoder-only DNA language model that incorporates a hybrid Transformer-Mamba2 architecture, seamlessly integrating the strengths of attention mechanisms with selective state-space models. This hybrid…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEnvironmental DNA in Biodiversity Studies
MethodsSoftmax · Attention Is All You Need
