Graph-Theoretic Models for the Prediction of Molecular Measurements
Anna Niane, Prudence Djagba

TL;DR
This paper evaluates and enhances a graph-theoretic model for molecular property prediction across multiple datasets, demonstrating significant improvements and competitiveness with deep learning methods while maintaining low computational costs.
Contribution
It systematically improves a classical graph-theoretic model with various techniques, achieving state-of-the-art results comparable to deep learning approaches.
Findings
Enhanced models achieve average R^2 of 0.79, a 165-274% improvement.
Classical models match or outperform GCNs on all datasets.
The framework is resource-efficient, training in under five minutes without GPUs.
Abstract
Graph-theoretic approaches offer simplicity, interpretability, and low computational cost for molecular property prediction. Among these, the model proposed by Mukwembi and Nyabadza, based on the external activity and internal activity indices, achieved strong results on a small flavonoid dataset. However, its ability to generalize to larger and chemically diverse datasets has not been tested. This study evaluates the baseline - polynomial model on five benchmark datasets from MoleculeNet, covering biological activity (BACE, 1,513 molecules), lipophilicity (LogP synthetic, 14,610 molecules; LogP experimental, 753 molecules), aqueous solubility (ESOL, 1,128 molecules), and hydration free energy (SAMPL, 642 molecules). The baseline model achieves an average , confirming limited transferability. To address this, a systematic enhancement…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
