Physical probes expose and alleviate chemical-environment collapse in molecular representations
Jiebin Fang, Zidi Yan, Churu Mao, Yongjun Jiang, Xinyi Tang, Lei Miao, Dan Lu, Yun Huang, Wanjing Ding, Zhongjun Ma

TL;DR
This paper introduces CLAIM, a contrastive learning framework that aligns molecular topologies with NMR data to improve atom-level molecular representations and property predictions.
Contribution
It presents a novel spectral alignment approach that alleviates chemical-environment collapse in molecular representations, enhancing various molecular property tasks.
Findings
CLAIM improves atom-level molecule-spectrum retrieval accuracy.
It enhances stereoisomer discrimination without explicit 3D modeling.
CLAIM transfers effectively to broader molecular property prediction tasks.
Abstract
Nuclear magnetic resonance (NMR) spectroscopy provides an experimental readout of local chemical environments, but its use in molecular representation learning has been constrained by heterogeneous data and incomplete atom-level assignments. Here we construct complementary high-fidelity experimental and computational 13C NMR resources, which reveal a recurrent form of representational collapse: atoms that are equivalent in molecular topology can remain experimentally distinct in their real chemical environments, whereas explicit 3D descriptions are further limited by static conformations in dynamic regimes. To alleviate this bottleneck, we develop CLAIM (Contrastive Learning for Atom-to-molecule Inference of Molecular NMR), a framework that aligns efficient topological molecular inputs with atom-resolved NMR observables. Through hierarchical chemical priors and cross-level contrastive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
