Generative Enzyme Design Guided by Functionally Important Sites and Small-Molecule Substrates
Zhenqiao Song, Yunlong Zhao, Wenxian Shi, Wengong Jin, Yang Yang, Lei Li

TL;DR
This paper introduces EnzyGen, a novel deep learning model that designs enzymes by generating amino acid sequences and 3D structures based on functionally important sites and substrates, advancing automated enzyme engineering.
Contribution
EnzyGen is the first unified model capable of designing enzymes across all functional families by integrating sequence, structure, and substrate information with a new attention-based network.
Findings
EnzyGen outperforms baselines with 10.79% higher substrate binding affinity.
The model effectively generates enzymes with high structural fidelity and substrate specificity.
EnzyBench dataset covers all enzyme families in PDB for comprehensive training.
Abstract
Enzymes are genetically encoded biocatalysts capable of accelerating chemical reactions. How can we automatically design functional enzymes? In this paper, we propose EnzyGen, an approach to learn a unified model to design enzymes across all functional families. Our key idea is to generate an enzyme's amino acid sequence and their three-dimensional (3D) coordinates based on functionally important sites and substrates corresponding to a desired catalytic function. These sites are automatically mined from enzyme databases. EnzyGen consists of a novel interleaving network of attention and neighborhood equivariant layers, which captures both long-range correlation in an entire protein sequence and local influence from nearest amino acids in 3D space. To learn the generative model, we devise a joint training objective, including a sequence generation loss, a position prediction loss and an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEnzyme Catalysis and Immobilization
