Multi-Modal CLIP-Informed Protein Editing
Mingze Yin, Hanjing Zhou, Yiheng Zhu, Miao Lin, Yixuan Wu, Jialu Wu,, Hongxia Xu, Chang-Yu Hsieh, Tingjun Hou, Jintai Chen, Jian Wu

TL;DR
ProtET is a novel multi-modal learning method that uses CLIP-inspired contrastive learning to enable controllable, interactive protein editing guided by biotext instructions, significantly improving stability and functionality.
Contribution
The paper introduces ProtET, a two-stage CLIP-informed approach that aligns protein and biotext representations for effective, instruction-guided protein editing, addressing limitations of prior methods.
Findings
ProtET outperforms state-of-the-art methods in protein editing tasks.
ProtET achieves over 16% improvements in stability metrics.
The method effectively enhances enzyme activity, stability, and antibody binding.
Abstract
Proteins govern most biological functions essential for life, but achieving controllable protein discovery and optimization remains challenging. Recently, machine learning-assisted protein editing (MLPE) has shown promise in accelerating optimization cycles and reducing experimental workloads. However, current methods struggle with the vast combinatorial space of potential protein edits and cannot explicitly conduct protein editing using biotext instructions, limiting their interactivity with human feedback. To fill these gaps, we propose a novel method called ProtET for efficient CLIP-informed protein editing through multi-modality learning. Our approach comprises two stages: in the pretraining stage, contrastive learning aligns protein-biotext representations encoded by two large language models (LLMs), respectively. Subsequently, during the protein editing stage, the fused features…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRNA and protein synthesis mechanisms · Monoclonal and Polyclonal Antibodies Research · Biotin and Related Studies
MethodsContrastive Learning
