TopP-S: Persistent homology based multi-task deep neural networks for simultaneous predictions of partition coefficient and aqueous solubility
Kedi Wu, Zhixiong Zhao, Renxiao Wang, Guo-Wei Wei

TL;DR
This paper introduces a novel topological representation called element specific persistent homology (ESPH) for small molecules, and employs multi-task deep neural networks to simultaneously predict aqueous solubility and partition coefficient with high accuracy.
Contribution
It presents a new algebraic topology-based molecular descriptor (ESPH) and a multi-task deep learning framework for improved simultaneous property prediction.
Findings
Achieved highly accurate predictions on six datasets.
Demonstrated the effectiveness of topological descriptors in machine learning.
Validated the approach's scalability and robustness.
Abstract
Aqueous solubility and partition coefficient are important physical properties of small molecules. Accurate theoretical prediction of aqueous solubility and partition coefficient plays an important role in drug design and discovery. The prediction accuracy depends crucially on molecular descriptors which are typically derived from theoretical understanding of the chemistry and physics of small molecules. The present work introduces an algebraic topology based method, called element specific persistent homology (ESPH), as a new representation of small molecules that is entirely different from conventional chemical and/or physical representations. ESPH describes molecular properties in terms of multiscale and multicomponent topological invariants. Such topological representation is systematical, comprehensive, and scalable with respect to molecular size and composition variations.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopological and Geometric Data Analysis · Computational Drug Discovery Methods · Bioinformatics and Genomic Networks
