Chem-GMNet: A Sphere-Native Geometric Transformer for Molecular Property Prediction
Deepak Warrier, Raja Sekhar Pappala

TL;DR
Chem-GMNet introduces a domain-native geometric transformer leveraging sphere-based modules for molecular property prediction, outperforming or matching existing models with fewer parameters and no pretraining.
Contribution
It presents a novel sphere-native transformer architecture tailored for chemistry, demonstrating improved performance over traditional text-based models on MoleculeNet benchmarks.
Findings
Chem-GMNet outperforms ChemBERTa-2 on 7 of 10 MoleculeNet endpoints.
Pretrained Chem-GMNet matches or exceeds ChemBERTa-2 on 6 of 8 shared endpoints.
Increasing sphere dimension improves performance, achieving state-of-the-art results without pretraining.
Abstract
Modern SMILES-based chemical language models obtain strong MoleculeNet performance by treating SMILES as generic text and compensating with multi-million-molecule self-supervised pretraining. We ask: when a domain carries structural priors as rich as chemistry's, does it warrant a domain-native transformer rather than a generic one rescued by scale? We answer affirmatively with \textbf{GM-Net} (Geometric Measure Network), a transformer family in which every module is replaced by a sphere-native counterpart, and instantiate it as \textbf{Chem-GMNet}. Three blocks follow: SH-Embedding (tokens as learnable directions on lifted through a Gegenbauer feature map); DualSKA (a per-head fusion of a linear-time gated Sphere-Flow recurrence whose persistent state we prove is the truncated multipole expansion of the input distribution, and a softmax Sphere-Kernel branch over the same…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
