LightCPPgen: An Explainable Machine Learning Pipeline for Rational Design of Cell Penetrating Peptides
Gabriele Maroni, Filip Stojceski, Lorenzo Pallante, Marco A. Deriu,, Dario Piga, Gianvito Grasso

TL;DR
LightCPPgen is an innovative machine learning and optimization framework that enables the rational, explainable, and efficient de novo design of cell-penetrating peptides, reducing experimental costs and time.
Contribution
The paper introduces LightCPPgen, a novel approach combining LightGBM and genetic algorithms for explainable and efficient CPP design, advancing the rational development of therapeutic peptides.
Findings
Developed an accurate, interpretable predictive model with 20 explainable features.
Integrated ML with optimization to generate high-penetrability CPP candidates.
Significantly reduced experimental efforts by prioritizing promising peptide sequences.
Abstract
Cell-penetrating peptides (CPPs) are powerful vectors for the intracellular delivery of a diverse array of therapeutic molecules. Despite their potential, the rational design of CPPs remains a challenging task that often requires extensive experimental efforts and iterations. In this study, we introduce an innovative approach for the de novo design of CPPs, leveraging the strengths of machine learning (ML) and optimization algorithms. Our strategy, named LightCPPgen, integrates a LightGBM-based predictive model with a genetic algorithm (GA), enabling the systematic generation and optimization of CPP sequences. At the core of our methodology is the development of an accurate, efficient, and interpretable predictive model, which utilizes 20 explainable features to shed light on the critical factors influencing CPP translocation capacity. The CPP predictive model works synergistically with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRNA Interference and Gene Delivery · Advanced biosensing and bioanalysis techniques
MethodsGenetic Algorithms
