Rank-and-Reason: Multi-Agent Collaboration Accelerates Zero-Shot Protein Mutation Prediction
Yang Tan, Yuanxi Yu, Can Wu, Bozitao Zhong, Mingchen Li, Guisheng Fan, Jiankang Zhu, Yafeng Liang, Nanqing Dong, Liang Hong

TL;DR
This paper introduces Rank-and-Reason, a multi-agent framework that automates protein mutation prediction and validation, significantly improving correlation and hit rates, and successfully identifying novel mutants with enhanced activity.
Contribution
The paper presents a novel two-stage agentic framework that automates protein mutation candidate ranking and structural reasoning, outperforming existing methods in zero-shot protein engineering tasks.
Findings
Achieved a Spearman correlation of 0.551 on ProteinGym, surpassing previous 0.518.
Improved Top-5 Hit Rate by up to 367% on ProteinGym-DMS99.
Validated effectiveness through wet-lab experiments, discovering mutants with 4.23- and 5.05-fold activity increases.
Abstract
Zero-shot mutation prediction is vital for low-resource protein engineering, yet existing protein language models (PLMs) often yield statistically confident results that ignore fundamental biophysical constraints. Currently, selecting candidates for wet-lab validation relies on manual expert auditing of PLM outputs, a process that is inefficient, subjective, and highly dependent on domain expertise. To address this, we propose Rank-and-Reason (VenusRAR), a two-stage agentic framework to automate this workflow and maximize expected wet-lab fitness. In the Rank-Stage, a Computational Expert and Virtual Biologist aggregate a context-aware multi-modal ensemble, establishing a new Spearman correlation record of 0.551 (vs. 0.518) on ProteinGym. In the Reason-Stage, an agentic Expert Panel employs chain-of-thought reasoning to audit candidates against geometric and structural constraints,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Bioinformatics · Genomics and Rare Diseases · vaccines and immunoinformatics approaches
