Decoding Translation-Related Functional Sequences in 5'UTRs Using Interpretable Deep Learning Models

Yuxi Lin; Yaxue Fang; Zehong Zhang; Zhouwu Liu; Siyun Zhong; Zhongfang Wang; Fulong Yu

arXiv:2507.16801·q-bio.QM·February 27, 2026

Decoding Translation-Related Functional Sequences in 5'UTRs Using Interpretable Deep Learning Models

Yuxi Lin, Yaxue Fang, Zehong Zhang, Zhouwu Liu, Siyun Zhong, Zhongfang Wang, Fulong Yu

PDF

Open Access

TL;DR

This paper presents UTR-STCNet, an interpretable deep learning model that accurately predicts translational efficiency from variable-length 5'UTRs and uncovers functional regulatory elements using saliency-guided clustering and attention mechanisms.

Contribution

Introduces UTR-STCNet, a novel Transformer-based architecture with saliency-aware clustering for flexible, interpretable modeling of 5'UTRs without input truncation or added computational cost.

Findings

01

Outperforms state-of-the-art models in predicting translational efficiency.

02

Recovers known regulatory elements like upstream AUGs and Kozak motifs.

03

Demonstrates biological interpretability of the model.

Abstract

Understanding how 5' untranslated regions (5'UTRs) regulate mRNA translation is critical for controlling protein expression and designing effective therapeutic mRNAs. While recent deep learning models have shown promise in predicting translational efficiency from 5'UTR sequences, most are constrained by fixed input lengths and limited interpretability. We introduce UTR-STCNet, a Transformer-based architecture for flexible and biologically grounded modeling of variable-length 5'UTRs. UTR-STCNet integrates a Saliency-Aware Token Clustering (SATC) module that iteratively aggregates nucleotide tokens into multi-scale, semantically meaningful units based on saliency scores. A Saliency-Guided Transformer (SGT) block then captures both local and distal regulatory dependencies using a lightweight attention mechanism. This combined architecture achieves efficient and interpretable modeling…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques