Interpretable QSPR Modeling using Recursive Feature Machines and Multi-scale Fingerprints
Jiaxuan Shen, Haitao Zhang, Yunjie Wang, Yilong Wang, Song Tao, Bo, Qiu, Ng Shyh-Chang

TL;DR
This paper introduces Recursive Feature Machines with multi-scale fingerprints and AGOP-based deep feature learning to improve interpretability and accuracy in QSPR modeling, achieving state-of-the-art results in molecular property prediction.
Contribution
It presents a novel RFM framework integrating deep feature learning with AGOP, multi-scale fingerprints, and interpretability techniques for enhanced molecular property modeling.
Findings
RFM-HF outperforms traditional ML and GNN models.
Multi-scale fingerprints reveal structural determinants effectively.
AGOP enhances kernel machine interpretability and performance.
Abstract
This study pioneers the application of Recursive Feature Machines (RFM) in QSPR modeling, introducing a tailored feature importance analysis approach to enhance interpretability. By leveraging deep feature learning through AGOP, RFM achieves state-of-the-art (SOTA) results in predicting molecular properties, as demonstrated through solubility prediction across nine benchmark datasets. To capture a wide array of structural information, we employ diverse molecular representations, including MACCS keys, Morgan fingerprints, and a custom multi-scale hybrid fingerprint (HF) derived from global descriptors and SMILES local fragmentation techniques. Notably, the HF offers significant advantages over MACCS and Morgan fingerprints in revealing structural determinants of molecular properties. The feature importance analysis in RFM provides robust local and global explanations, effectively…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Semantic Web and Ontologies
MethodsFragmentation
