$\text{M}^{2}$LLM: Multi-view Molecular Representation Learning with Large Language Models
Jiaxin Ju, Yizhen Zheng, Huan Yee Koh, Can Wang, Shirui Pan

TL;DR
This paper introduces $ ext{M}^{2}$LLM, a multi-view framework leveraging large language models to generate rich molecular representations by integrating structural, task-specific, and rule-based perspectives, leading to state-of-the-art results.
Contribution
The paper proposes a novel multi-view approach that combines LLM reasoning with molecular data, enhancing molecular property prediction beyond traditional methods.
Findings
$ ext{M}^{2}$LLM achieves state-of-the-art performance on multiple benchmarks.
LLM-derived representations outperform traditional molecular features.
Multi-view fusion adapts dynamically to different tasks.
Abstract
Accurate molecular property prediction is a critical challenge with wide-ranging applications in chemistry, materials science, and drug discovery. Molecular representation methods, including fingerprints and graph neural networks (GNNs), achieve state-of-the-art results by effectively deriving features from molecular structures. However, these methods often overlook decades of accumulated semantic and contextual knowledge. Recent advancements in large language models (LLMs) demonstrate remarkable reasoning abilities and prior knowledge across scientific domains, leading us to hypothesize that LLMs can generate rich molecular representations when guided to reason in multiple perspectives. To address these gaps, we propose LLM, a multi-view framework that integrates three perspectives: the molecular structure view, the molecular task view, and the molecular rules view. These…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods · Advanced Graph Neural Networks
