UniCorn: A Unified Contrastive Learning Approach for Multi-view Molecular Representation Learning
Shikun Feng, Yuyan Ni, Minghao Li, Yanwen Huang, Zhi-Ming Ma, Wei-Ying, Ma, Yanyan Lan

TL;DR
UniCorn is a unified contrastive learning framework for molecular representation that combines multiple views to improve performance across diverse molecular tasks, addressing limitations of existing specialized pre-training methods.
Contribution
The paper introduces UniCorn, a novel unified pre-training framework that integrates multiple molecular views, enhancing generality and effectiveness across various molecular tasks.
Findings
Achieves state-of-the-art results on quantum, physicochemical, and biological tasks.
Provides comprehensive ablation studies validating universality.
Unifies existing contrastive learning methods for molecular pre-training.
Abstract
Recently, a noticeable trend has emerged in developing pre-trained foundation models in the domains of CV and NLP. However, for molecular pre-training, there lacks a universal model capable of effectively applying to various categories of molecular tasks, since existing prevalent pre-training methods exhibit effectiveness for specific types of downstream tasks. Furthermore, the lack of profound understanding of existing pre-training methods, including 2D graph masking, 2D-3D contrastive learning, and 3D denoising, hampers the advancement of molecular foundation models. In this work, we provide a unified comprehension of existing pre-training methods through the lens of contrastive learning. Thus their distinctions lie in clustering different views of molecules, which is shown beneficial to specific downstream tasks. To achieve a complete and general-purpose molecular representation, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Bioinformatics · Machine Learning in Materials Science · Domain Adaptation and Few-Shot Learning
