TriFit: Trimodal Fusion with Protein Dynamics for Mutation Fitness Prediction

Seungik Cho

arXiv:2604.12026·cs.LG·April 15, 2026

TriFit: Trimodal Fusion with Protein Dynamics for Mutation Fitness Prediction

Seungik Cho

PDF

TL;DR

TriFit is a multimodal framework that integrates sequence, structure, and protein dynamics to improve mutation fitness prediction, outperforming existing models on benchmark datasets.

Contribution

It introduces a novel Mixture-of-Experts fusion module that adaptively combines modalities, incorporating protein dynamics into mutation effect prediction.

Findings

01

TriFit achieves AUROC 0.897 on ProteinGym benchmark.

02

Dynamics modality provides the largest contribution to performance.

03

TriFit produces well-calibrated probabilistic outputs.

Abstract

Predicting the functional impact of single amino acid substitutions (SAVs) is central to understanding genetic disease and engineering therapeutic proteins. While protein language models and structure-based methods have achieved strong performance on this task, they systematically neglect protein dynamics; residue flexibility, correlated motions, and allosteric coupling are well-established determinants of mutational tolerance in structural biology, yet have not been incorporated into supervised variant effect predictors. We present TriFit, a multimodal framework that integrates sequence, structure, and protein dynamics through a four-expert Mixture-of-Experts (MoE) fusion module with trimodal cross-modal contrastive learning. Sequence embeddings are extracted via masked marginal scoring with ESM-2 (650M); structural embeddings from AlphaFold2-predicted C-alpha geometries; and dynamics…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.