Ollivier persistent Ricci curvature (OPRC) based molecular representation for drug design
JunJie Wee, Kelin Xia

TL;DR
This paper introduces Ollivier persistent Ricci curvature (OPRC) as a novel molecular featurization technique for drug design, demonstrating its superiority over traditional descriptors in predicting protein-ligand binding affinity.
Contribution
The paper presents the first application of persistent Ricci curvature, specifically OPRC, for molecular featurization and integrates it with machine learning for improved drug design predictions.
Findings
OPRC outperforms traditional molecular descriptors in binding affinity prediction.
The method effectively captures molecular geometric and topological properties.
Experimental results on PDBbind datasets validate the approach's effectiveness.
Abstract
Efficient molecular featurization is one of the major issues for machine learning models in drug design. Here we propose persistent Ricci curvature (PRC), in particular Ollivier persistent Ricci curvature (OPRC), for the molecular featurization and feature engineering, for the first time. Filtration process proposed in persistent homology is employed to generate a series of nested molecular graphs. Persistence and variation of Ollivier Ricci curvatures on these nested graphs are defined as Ollivier persistent Ricci curvature. Moreover, persistent attributes, which are statistical and combinatorial properties of OPRCs during the filtration process, are used as molecular descriptors, and further combined with machine learning models, in particular, gradient boosting tree (GBT). Our OPRC-GBT model is used in the prediction of protein-ligand binding affinity, which is one of key steps in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Topological and Geometric Data Analysis · Bioinformatics and Genomic Networks
