Graph-Theoretic Models for the Prediction of Molecular Measurements

Anna Niane; Prudence Djagba

arXiv:2604.19840·cs.LG·April 23, 2026

Graph-Theoretic Models for the Prediction of Molecular Measurements

Anna Niane, Prudence Djagba

PDF

TL;DR

This paper evaluates and enhances a graph-theoretic model for molecular property prediction across multiple datasets, demonstrating significant improvements and competitiveness with deep learning methods while maintaining low computational costs.

Contribution

It systematically improves a classical graph-theoretic model with various techniques, achieving state-of-the-art results comparable to deep learning approaches.

Findings

01

Enhanced models achieve average R^2 of 0.79, a 165-274% improvement.

02

Classical models match or outperform GCNs on all datasets.

03

The framework is resource-efficient, training in under five minutes without GPUs.

Abstract

Graph-theoretic approaches offer simplicity, interpretability, and low computational cost for molecular property prediction. Among these, the model proposed by Mukwembi and Nyabadza, based on the external activity $D (G)$ and internal activity $ζ (G)$ indices, achieved strong results on a small flavonoid dataset. However, its ability to generalize to larger and chemically diverse datasets has not been tested. This study evaluates the baseline $D (G)$ - $ζ (G)$ polynomial model on five benchmark datasets from MoleculeNet, covering biological activity (BACE, 1,513 molecules), lipophilicity (LogP synthetic, 14,610 molecules; LogP experimental, 753 molecules), aqueous solubility (ESOL, 1,128 molecules), and hydration free energy (SAMPL, 642 molecules). The baseline model achieves an average $R^{2} = 0.24$ , confirming limited transferability. To address this, a systematic enhancement…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.