Atomas: Hierarchical Alignment on Molecule-Text for Unified Molecule Understanding and Generation
Yikun Zhang, Geyan Ye, Chaohao Yuan, Bo Han, Long-Kai Huang, Jianhua, Yao, Wei Liu, Yu Rong

TL;DR
Atomas introduces a hierarchical alignment framework for molecule-text representation learning, capturing fine-grained details and improving performance across diverse molecular understanding and generation tasks.
Contribution
It proposes a novel hierarchical alignment model that learns fine-grained fragment correspondence between molecules and text, enhancing molecular representation capabilities.
Findings
Outperforms 11 baselines on 12 tasks across 11 datasets
Demonstrates robustness and scalability in scaling experiments
Qualitative analysis confirms chemical relevance of learned representations
Abstract
Molecule-and-text cross-modal representation learning has emerged as a promising direction for enhancing the quality of molecular representation, thereby improving performance in various scientific fields. However, most approaches employ a global alignment approach to learn the knowledge from different modalities that may fail to capture fine-grained information, such as molecule-and-text fragments and stereoisomeric nuances, which is crucial for downstream tasks. Furthermore, it is incapable of modeling such information using a similar global alignment strategy due to the lack of annotations about the fine-grained fragments in the existing dataset. In this paper, we propose Atomas, a hierarchical molecular representation learning framework that jointly learns representations from SMILES strings and text. We design a Hierarchical Adaptive Alignment model to automatically learn the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Machine Learning in Materials Science · History and advancements in chemistry
MethodsALIGN
