A conformational benchmark for optical property prediction with solvent-aware graph neural networks
Denis Potapov, Sergei Rogovoi, Kuzma Khrabrov, Konstantin Ushenin, Alexey Korovin, Anton Ber, Artur Kadurin, Artem Tsypin

TL;DR
This paper introduces a new dataset and benchmark for predicting molecular optical properties using 3D-aware neural networks, improving accuracy by over 30%.
Contribution
The novel contribution is a solvent-aware modification to 3D GNNs and a curated benchmark dataset with conformational data for optical property prediction.
Findings
The proposed 3D GNN model achieves a 30% improvement in MAE over previous methods.
Geometry optimization fidelity significantly impacts prediction accuracy.
The dataset includes 26,369 chromophore-solvent pairs with multiple conformations.
Abstract
Accurately predicting optical spectra of molecules is essential for creating better OLED emitters, solar-cell dyes, and fluorescent probes. Traditional methods, such as time-dependent density-functional theory, are computationally expensive and often inaccurate. Current Graph Neural Network (GNN) approaches for optical properties prediction are faster and offer better performance. Still, they operate on 2D graphs and ignore the 3D geometrical features that control excited-state behavior. We present nablaColors-3D, a rigorously curated dataset for the prediction of optical properties consisting of 26369 chromophore-solvent pairs with three conformations optimized at different levels of quantum theory. Based on this dataset, we establish a scaffold-split benchmark for 3D GNNs and systematically quantify how the fidelity of geometry optimization affects accuracy. Furthermore, we propose a…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Advanced Graph Neural Networks · Computational Drug Discovery Methods
