Exploring Hierarchical Molecular Graph Representation in Multimodal LLMs
Chengxin Hu, Hao Li, Yihe Yuan, Jing Li, Ivor Tsang

TL;DR
This paper investigates the impact of multi-level graph features on multimodal LLMs for biochemical tasks, revealing that hierarchical graph representations enhance model understanding and performance.
Contribution
It introduces an analysis of feature granularity effects and highlights the need for dynamic hierarchical graph processing in multimodal LLMs for molecular data.
Findings
Reducing GNN features to a single token does not harm performance.
Different graph feature levels influence molecule quality and task accuracy.
Current models lack comprehensive understanding of hierarchical graph features.
Abstract
Following the milestones in large language models (LLMs) and multimodal models, we have seen a surge in applying LLMs to biochemical tasks. Leveraging graph features and molecular text representations, LLMs can tackle various tasks, such as predicting chemical reaction outcomes and describing molecular properties. However, most current work overlooks the *multi-level nature* of the graph modality, even though different chemistry tasks may benefit from different feature levels. In this work, we first study the effect of feature granularity and reveal that even reducing all GNN-generated feature tokens to a single one does not significantly impact model performance. We then investigate the effect of various graph feature levels and demonstrate that both the quality of LLM-generated molecules and model performance across different tasks depend on different graph feature levels. Therefore,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Biomedical Text Mining and Ontologies · Advanced Text Analysis Techniques
