LLaMo: Large Language Model-based Molecular Graph Assistant

Jinyoung Park; Minseong Bae; Dohwan Ko; Hyunwoo J. Kim

arXiv:2411.00871·cs.LG·November 5, 2024

LLaMo: Large Language Model-based Molecular Graph Assistant

Jinyoung Park, Minseong Bae, Dohwan Ko, Hyunwoo J. Kim

PDF

Open Access 1 Repo 1 Video

TL;DR

LLaMo is a novel large language model-based molecular graph assistant that integrates graph and language modalities for diverse molecular understanding tasks, leveraging instruction tuning and a multi-level graph projector.

Contribution

It introduces a multi-level graph projector and molecular graph instruction data for end-to-end training of a molecular graph-language model, enhancing molecular understanding capabilities.

Findings

01

LLaMo outperforms existing models on molecular description and property prediction tasks.

02

The multi-level graph projector effectively bridges graph and language modalities.

03

Instruction tuning improves generalization across molecular tasks.

Abstract

Large Language Models (LLMs) have demonstrated remarkable generalization and instruction-following capabilities with instruction tuning. The advancements in LLMs and instruction tuning have led to the development of Large Vision-Language Models (LVLMs). However, the competency of the LLMs and instruction tuning have been less explored in the molecular domain. Thus, we propose LLaMo: Large Language Model-based Molecular graph assistant, which is an end-to-end trained large molecular graph-language model. To bridge the discrepancy between the language and graph modalities, we present the multi-level graph projector that transforms graph representations into graph tokens by abstracting the output representations of each GNN layer and motif representations with the cross-attention mechanism. We also introduce machine-generated molecular graph instruction data to instruction-tune the large…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mlvlab/llamo
pytorchOfficial

Videos

LLaMo: Large Language Model-based Molecular Graph Assistant· slideslive

Taxonomy

TopicsMachine Learning in Materials Science · Advanced Graph Neural Networks · Topic Modeling