AMix-1: A Pathway to Test-Time Scalable Protein Foundation Model

Changze Lv; Jiang Zhou; Siyu Long; Lihao Wang; Jiangtao Feng; Dongyu Xue; Yu Pei; Hao Wang; Zherui Zhang; Yuchen Cai; Zhiqiang Gao; Ziyuan Ma; Jiakai Hu; Chaochen Gao; Jingjing Gong; Yuxuan Song; Shuyi Zhang; Xiaoqing Zheng; Deyi Xiong; Lei Bai; Wanli Ouyang; Ya-Qin Zhang; Wei-Ying Ma; Bowen Zhou; Hao Zhou

arXiv:2507.08920·q-bio.BM·August 12, 2025

AMix-1: A Pathway to Test-Time Scalable Protein Foundation Model

Changze Lv, Jiang Zhou, Siyu Long, Lihao Wang, Jiangtao Feng, Dongyu Xue, Yu Pei, Hao Wang, Zherui Zhang, Yuchen Cai, Zhiqiang Gao, Ziyuan Ma, Jiakai Hu, Chaochen Gao, Jingjing Gong, Yuxuan Song, Shuyi Zhang, Xiaoqing Zheng, Deyi Xiong, Lei Bai, Wanli Ouyang, Ya-Qin Zhang

PDF

1 Models

TL;DR

AMix-1 is a scalable protein foundation model that leverages Bayesian Flow Networks, in-context learning, and test-time scaling to enhance protein design and engineering, achieving significant activity improvements and scalable performance.

Contribution

We present AMix-1, a novel protein foundation model with a systematic training methodology, in-context learning framework, and test-time scaling algorithm for scalable protein engineering.

Findings

01

Achieved a 1.7-billion parameter model with structural understanding.

02

Designed a protein variant with up to 50x activity increase.

03

Demonstrated scalable performance gains with test-time algorithms.

Abstract

We introduce AMix-1, a powerful protein foundation model built on Bayesian Flow Networks and empowered by a systematic training methodology, encompassing pretraining scaling laws, emergent capability analysis, in-context learning mechanism, and test-time scaling algorithm. To guarantee robust scalability, we establish a predictive scaling law and reveal the progressive emergence of structural understanding via loss perspective, culminating in a strong 1.7-billion model. Building on this foundation, we devise a multiple sequence alignment (MSA)-based in-context learning strategy to unify protein design into a general framework, where AMix-1 recognizes deep evolutionary signals among MSAs and consistently generates structurally and functionally coherent proteins. This framework enables the successful design of a dramatically improved AmeR variant with an up to $50 \times$ activity increase…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
GenSI/AMix-1-1.7B
model· ♡ 1
♡ 1

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.