Towards Joint Sequence-Structure Generation of Nucleic Acid and Protein Complexes with SE(3)-Discrete Diffusion
Alex Morehead, Jeffrey Ruffolo, Aadyot Bhatnagar, Ali Madani

TL;DR
MMDiff is a novel generative model that jointly designs sequences and structures of nucleic acid and protein complexes using SE(3)-discrete diffusion, advancing macromolecular design capabilities.
Contribution
It introduces MMDiff, the first model to jointly generate sequences and structures of nucleic acid-protein complexes with SE(3)-discrete diffusion noise.
Findings
Successfully generates micro-RNA and single-stranded DNA molecules.
Modestly capable of modeling DNA and RNA interactions with multi-chain protein complexes.
Provides a new benchmark for macromolecular complex generation.
Abstract
Generative models of macromolecules carry abundant and impactful implications for industrial and biomedical efforts in protein engineering. However, existing methods are currently limited to modeling protein structures or sequences, independently or jointly, without regard to the interactions that commonly occur between proteins and other macromolecules. In this work, we introduce MMDiff, a generative model that jointly designs sequences and structures of nucleic acid and protein complexes, independently or in complex, using joint SE(3)-discrete diffusion noise. Such a model has important implications for emerging areas of macromolecular design including structure-based transcription factor design and design of noncoding RNA sequences. We demonstrate the utility of MMDiff through a rigorous new design benchmark for macromolecular complex generation that we introduce in this work. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRNA and protein synthesis mechanisms · RNA Research and Splicing · Genomics and Chromatin Dynamics
MethodsDiffusion
