Near-Lossless Model Compression Enables Longer Context Inference in DNA Large Language Models
Rui Zhu, Xiaopu Zhou, Haixu Tang, Stephen W. Scherer, Lucila Ohno-Machado

TL;DR
This paper introduces FOCUS, a novel context-compression method for DNA large language models that drastically reduces memory and computational costs, enabling near-lossless inference over much longer genomic sequences.
Contribution
FOCUS is a hierarchical, learnable compression module that integrates with pretrained DNA LLMs to enable ultra-long sequence inference with minimal fidelity loss.
Findings
FOCUS achieves about 100x compression with near-lossless fidelity.
It reduces KV-cache memory and scales inference from quadratic to near-linear complexity.
Enables 100x longer inference windows on standard GPUs.
Abstract
Trained on massive cross-species DNA corpora, DNA large language models (LLMs) learn the fundamental "grammar" and evolutionary patterns of genomic sequences. This makes them powerful priors for DNA sequence modeling, particularly over long ranges. However, two major constraints hinder their use in practice: the quadratic computational cost of self-attention and the growing memory required for key-value (KV) caches during autoregressive decoding. These constraints force the use of heuristics such as fixed-window truncation or sliding windows, which compromise fidelity on ultra-long sequences by discarding distant information. We introduce FOCUS (Feature-Oriented Compression for Ultra-long Self-attention), a progressive context-compression module that can be plugged into pretrained DNA LLMs. FOCUS combines the established k-mer representation in genomics with learnable hierarchical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Chromatin Dynamics · Language and cultural evolution · Natural Language Processing Techniques
