Secondary Structure-Guided Novel Protein Sequence Generation with Latent Graph Diffusion
Yutong Hu, Yang Tan, Andi Han, Lirong Zheng, Liang Hong, Bingxin Zhou

TL;DR
This paper introduces CPDiffusion-SS, a novel latent graph diffusion model that generates diverse protein sequences guided by secondary structural information, improving the reliability and biological relevance of de novo protein design.
Contribution
The paper presents a new diffusion-based model that incorporates secondary structure guidance for more flexible and structurally consistent protein sequence generation.
Findings
Outperforms baseline methods on open benchmarks
Produces highly diverse and novel protein sequences
Case studies confirm biological relevance
Abstract
The advent of deep learning has introduced efficient approaches for de novo protein sequence design, significantly improving success rates and reducing development costs compared to computational or experimental methods. However, existing methods face challenges in generating proteins with diverse lengths and shapes while maintaining key structural features. To address these challenges, we introduce CPDiffusion-SS, a latent graph diffusion model that generates protein sequences based on coarse-grained secondary structural information. CPDiffusion-SS offers greater flexibility in producing a variety of novel amino acid sequences while preserving overall structural constraints, thus enhancing the reliability and diversity of generated proteins. Experimental analyses demonstrate the significant superiority of the proposed method in producing diverse and novel sequences, with CPDiffusion-SS…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetics, Bioinformatics, and Biomedical Research · Machine Learning in Bioinformatics · Bioinformatics and Genomic Networks
MethodsDiffusion
