Global Human-guided Counterfactual Explanations for Molecular Properties via Reinforcement Learning
Danqing Wang, Antonis Antoniades, Kha-Dinh Luong, Edwin Zhang, Mert, Kosan, Jiachen Li, Ambuj Singh, William Yang Wang, Lei Li

TL;DR
This paper introduces RLHEX, a reinforcement learning framework that generates human-aligned global counterfactual explanations for molecular property prediction, improving interpretability and coverage over existing methods.
Contribution
RLHEX is a novel model combining a VAE-based graph generator and human-defined principles, optimized with PPO, to produce interpretable global explanations for molecular GNNs.
Findings
Increases explanation coverage by 4.12%
Reduces explanation set distance by 0.47%
Aligns explanations with domain expertise
Abstract
Counterfactual explanations of Graph Neural Networks (GNNs) offer a powerful way to understand data that can naturally be represented by a graph structure. Furthermore, in many domains, it is highly desirable to derive data-driven global explanations or rules that can better explain the high-level properties of the models and data in question. However, evaluating global counterfactual explanations is hard in real-world datasets due to a lack of human-annotated ground truth, which limits their use in areas like molecular sciences. Additionally, the increasing scale of these datasets provides a challenge for random search-based methods. In this paper, we develop a novel global explanation model RLHEX for molecular property prediction. It aligns the counterfactual explanations with human-defined principles, making the explanations more interpretable and easy for experts to evaluate. RLHEX…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Machine Learning in Materials Science
MethodsSparse Evolutionary Training · Adapter
