Coordinating Cross-modal Distillation for Molecular Property Prediction
Hao Zhang, Nan Zhang, Ruixin Zhang, Lei Shen, Yingyi Zhang, and Meng, Liu

TL;DR
This paper introduces a novel cross-modal distillation framework for molecular property prediction that effectively transfers 3D structural knowledge to 2D models, improving accuracy and efficiency in molecular graph learning.
Contribution
The paper proposes a coordinated distillation framework with global and local components, addressing challenges of view discrepancy and size variability, and provides theoretical insights for better atom and molecule information integration.
Findings
Achieved 6.9% performance improvement on PCQM4Mv2 dataset.
Attained fourth place with MAE of 0.0734 on OGB-LSC 2022 challenge.
Outperformed existing methods in molecular property prediction tasks.
Abstract
In recent years, molecular graph representation learning (GRL) has drawn much more attention in molecular property prediction (MPP) problems. The existing graph methods have demonstrated that 3D geometric information is significant for better performance in MPP. However, accurate 3D structures are often costly and time-consuming to obtain, limiting the large-scale application of GRL. It is an intuitive solution to train with 3D to 2D knowledge distillation and predict with only 2D inputs. But some challenging problems remain open for 3D to 2D distillation. One is that the 3D view is quite distinct from the 2D view, and the other is that the gradient magnitudes of atoms in distillation are discrepant and unstable due to the variable molecular size. To address these challenging problems, we exclusively propose a distillation framework that contains global molecular distillation and local…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Machine Learning in Materials Science · Various Chemistry Research Topics
MethodsMasked autoencoder · Knowledge Distillation
