Context-Aware Regularization with Markovian Integration for Attention-Based Nucleotide Analysis
Mohammadsaleh Refahi, Mahdi Abavisani, Bahrad A. Sokhansanj, James R. Brown, and Gail Rosen

TL;DR
This paper introduces CARMANIA, a novel self-supervised framework for nucleotide sequence analysis that integrates Markovian transition regularization into attention models, significantly improving long-range dependency capture and task performance.
Contribution
CARMANIA uniquely combines transition-matrix loss with attention mechanisms to explicitly model higher-order dependencies in genomic sequences, enhancing accuracy and efficiency.
Findings
Outperforms previous models by at least 7% on diverse tasks.
Matches or exceeds state-of-the-art on shorter sequences.
Achieves up to 34% MCC improvement in enhancer prediction.
Abstract
Transformers have revolutionized nucleotide sequence analysis, yet capturing long-range dependencies remains challenging. Recent studies show that autoregressive transformers often exhibit Markovian behavior by relying on fixed-length context windows for next-token prediction. However, standard self-attention mechanisms are computationally inefficient for long sequences due to their quadratic complexity and do not explicitly enforce global transition consistency. We introduce CARMANIA (Context-Aware Regularization with Markovian Integration for Attention-Based Nucleotide Analysis), a self-supervised pretraining framework that augments next-token (NT) prediction with a transition-matrix (TM) loss. The TM loss aligns predicted token transitions with empirically derived n-gram statistics from each input sequence, encouraging the model to capture higher-order dependencies beyond local…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsGene expression and cancer classification · Machine Learning in Bioinformatics · Cell Image Analysis Techniques
