Improving Large Molecular Language Model via Relation-aware Multimodal Collaboration

Jinyoung Park; Minseong Bae; Jeehye Na; Hyunwoo J. Kim

arXiv:2601.12256·cs.AI·January 21, 2026

Improving Large Molecular Language Model via Relation-aware Multimodal Collaboration

Jinyoung Park, Minseong Bae, Jeehye Na, Hyunwoo J. Kim

PDF

Open Access 1 Video

TL;DR

This paper introduces CoLLaMo, a relation-aware multimodal large language model for molecules that improves integration of diverse molecular data, reducing hallucinations and enhancing performance across various molecular understanding tasks.

Contribution

The paper proposes a novel relation-aware multimodal attention mechanism and an automatic evaluation framework, advancing molecular language models' robustness and multimodal integration capabilities.

Findings

01

Achieves state-of-the-art results on multiple molecular tasks

02

Reduces hallucination and improves robustness in molecular understanding

03

Enhances generalization across molecular modalities

Abstract

Large language models (LLMs) have demonstrated their instruction-following capabilities and achieved powerful performance on various tasks. Inspired by their success, recent works in the molecular domain have led to the development of large molecular language models (LMLMs) that integrate 1D molecular strings or 2D molecular graphs into the language models. However, existing LMLMs often suffer from hallucination and limited robustness, largely due to inadequate integration of diverse molecular modalities such as 1D sequences, 2D molecular graphs, and 3D conformations. To address these limitations, we propose CoLLaMo, a large language model-based molecular assistant equipped with a multi-level molecular modality-collaborative projector. The relation-aware modality-collaborative attention mechanism in the projector facilitates fine-grained and relation-guided information exchange between…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Improving Large Molecular Language Model via Relation-aware Multimodal Collaboration· underline

Taxonomy

TopicsMachine Learning in Materials Science · Advanced Graph Neural Networks · Topic Modeling