Swarms of Large Language Model Agents for Protein Sequence Design with Experimental Validation
Fiona Y. Wang, Di Sheng Lee, David L. Kaplan, Markus J. Buehler

TL;DR
This paper introduces a decentralized swarm intelligence framework using multiple large language model agents for de novo protein design, validated experimentally, offering a flexible, scalable, and efficient alternative to existing methods.
Contribution
The paper presents a novel agent-based, decentralized LLM framework for protein design that operates without fine-tuning, enabling diverse sequence generation with emergent behaviors.
Findings
Achieves efficient, objective-directed protein design within a few GPU-hours.
Operates without fine-tuning or specialized training, demonstrating scalability.
Validated with experiments on alpha helix and coil proteins.
Abstract
Designing proteins de novo with tailored structural, physicochemical, and functional properties remains a grand challenge in biotechnology, medicine, and materials science, due to the vastness of sequence space and the complex coupling between sequence, structure, and function. Current state-of-the-art generative methods, such as protein language models (PLMs) and diffusion-based architectures, often require extensive fine-tuning, task-specific data, or model reconfiguration to support objective-directed design, thereby limiting their flexibility and scalability. To overcome these limitations, we present a decentralized, agent-based framework inspired by swarm intelligence for de novo protein design. In this approach, multiple large language model (LLM) agents operate in parallel, each assigned to a specific residue position. These agents iteratively propose context-aware mutations by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProtein Structure and Dynamics · RNA and protein synthesis mechanisms · Machine Learning in Bioinformatics
