TriFit: Trimodal Fusion with Protein Dynamics for Mutation Fitness Prediction
Seungik Cho

TL;DR
TriFit is a multimodal framework that integrates sequence, structure, and protein dynamics to improve mutation fitness prediction, outperforming existing models on benchmark datasets.
Contribution
It introduces a novel Mixture-of-Experts fusion module that adaptively combines modalities, incorporating protein dynamics into mutation effect prediction.
Findings
TriFit achieves AUROC 0.897 on ProteinGym benchmark.
Dynamics modality provides the largest contribution to performance.
TriFit produces well-calibrated probabilistic outputs.
Abstract
Predicting the functional impact of single amino acid substitutions (SAVs) is central to understanding genetic disease and engineering therapeutic proteins. While protein language models and structure-based methods have achieved strong performance on this task, they systematically neglect protein dynamics; residue flexibility, correlated motions, and allosteric coupling are well-established determinants of mutational tolerance in structural biology, yet have not been incorporated into supervised variant effect predictors. We present TriFit, a multimodal framework that integrates sequence, structure, and protein dynamics through a four-expert Mixture-of-Experts (MoE) fusion module with trimodal cross-modal contrastive learning. Sequence embeddings are extracted via masked marginal scoring with ESM-2 (650M); structural embeddings from AlphaFold2-predicted C-alpha geometries; and dynamics…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
