Bi-TEAM: A Unified Cross-Scale Representation Learning Framework for Chemically Modified Biomolecules
Chunbin Gu, Zijun Gao, Mutian He, Jingjie Zhang, Haipeng Wen, Zihao Luo, Xiaorui Wang, Hanqun Cao, Jiajun Bu, Chang-Yu Hsieh, Pheng Ann Heng

TL;DR
Bi-TEAM is a unified framework that combines protein and chemical language models to improve representation learning for chemically modified biomolecules, enhancing accuracy in biochemical predictions and peptide design.
Contribution
It introduces a novel bi-gated residual fusion mechanism that integrates chemical details into global protein contexts without expanding the alphabet.
Findings
Achieved up to 66% improvement in MCC on scaffold-similarity splits.
Increased hemolysis prediction accuracy by 350%.
Quadrupled success rate in designing cell-penetrating cyclic peptides.
Abstract
Representation learning for protein biochemical space faces a difficult trade-off: protein language models excel at capturing long-range biological semantics but often miss fine-grained chemical details. Conversely, chemical language models encode atomic information but lack broader sequence context. To address this, we introduce Bi-TEAM (Bi-gated Residual Space Modification), a general framework that injects localized chemical variation into global protein contexts. By ensuring robustness against perturbations such as non-canonical amino acids, post-translational modifications (PTMs), and topological constraints, Bi-TEAM uncovers functional chemical dependencies often missed by evolutionary baselines. Mechanistically, Bi-TEAM maps non-canonical residues to their natural counterparts and injects atomic-level data via a bi-gated residual fusion mechanism. Crucially, this process uses…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Bioinformatics · Machine Learning in Materials Science · Protein Structure and Dynamics
