Target specific peptide design using latent space approximate trajectory collector
Tong Lin, Sijie Chen, Ruchira Basu, Dehu Pei, Xiaolin Cheng, Levent, Burak Kara

TL;DR
This paper introduces LSATC, a novel machine learning approach for peptide design targeting specific proteins, overcoming data scarcity by sampling in a latent space to generate peptides with improved binding properties.
Contribution
The paper presents LSATC, a new end-to-end peptide design architecture that uses latent space trajectory sampling to target specific proteins, demonstrating superior performance over existing models.
Findings
LSATC samples peptides with 36% lower binding scores.
LSATC produces peptides with 284% less hydrophobicity.
All experimentally validated peptides showed at least 20% improved binding.
Abstract
Despite the prevalence and many successes of deep learning applications in de novo molecular design, the problem of peptide generation targeting specific proteins remains unsolved. A main barrier for this is the scarcity of the high-quality training data. To tackle the issue, we propose a novel machine learning based peptide design architecture, called Latent Space Approximate Trajectory Collector (LSATC). It consists of a series of samplers on an optimization trajectory on a highly non-convex energy landscape that approximates the distributions of peptides with desired properties in a latent space. The process involves little human intervention and can be implemented in an end-to-end manner. We demonstrate the model by the design of peptide extensions targeting Beta-catenin, a key nuclear effector protein involved in canonical Wnt signalling. When compared with a random sampler, LSATC…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAntimicrobial Peptides and Activities · Chemical Synthesis and Analysis · Machine Learning in Bioinformatics
MethodsBalanced Selection
