Could Chemical LLMs benefit from Message Passing

Jiaqing Xie; Ziheng Chi

arXiv:2405.08334·cs.LG·June 17, 2025

Could Chemical LLMs benefit from Message Passing

Jiaqing Xie, Ziheng Chi

PDF

Open Access

TL;DR

This paper explores whether integrating message passing neural networks with chemical language models can improve molecular property prediction, especially for smaller molecular graphs, by proposing and testing two information fusion strategies.

Contribution

It introduces two novel strategies for integrating MPNNs with language models and evaluates their effectiveness on molecular graph datasets.

Findings

01

Integration improves performance on small molecular graphs.

02

No significant gains on large molecular graphs.

03

Contrast learning and fusion strategies are effective for smaller graphs.

Abstract

Pretrained language models (LMs) showcase significant capabilities in processing molecular text, while concurrently, message passing neural networks (MPNNs) demonstrate resilience and versatility in the domain of molecular science. Despite these advancements, we find there are limited studies investigating the bidirectional interactions between molecular structures and their corresponding textual representations. Therefore, in this paper, we propose two strategies to evaluate whether an information integration can enhance the performance: contrast learning, which involves utilizing an MPNN to supervise the training of the LM, and fusion, which exploits information from both models. Our empirical analysis reveals that the integration approaches exhibit superior performance compared to baselines when applied to smaller molecular graphs, while these integration approaches do not yield…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMolecular Junctions and Nanostructures · Distributed systems and fault tolerance

MethodsMessage Passing Neural Network