SciCore-Mol: Augmenting Large Language Models with Pluggable Molecular Cognition Modules

Yuxuan Chen; Changwei Lv; Yunduo Xiao; Zhongjing Du; Daquan Zhou; Yukun Yan; Zheni Zeng; Zhiyuan Liu

arXiv:2605.22287·cs.AI·May 22, 2026

SciCore-Mol: Augmenting Large Language Models with Pluggable Molecular Cognition Modules

Yuxuan Chen, Changwei Lv, Yunduo Xiao, Zhongjing Du, Daquan Zhou, Yukun Yan, Zheni Zeng, Zhiyuan Liu

PDF

TL;DR

SciCore-Mol enhances large language models with specialized modules for molecular perception, generation, and reasoning, significantly improving performance on chemical tasks and enabling scientific discovery.

Contribution

It introduces a modular framework with pluggable cognitive modules that bridge the gap between linguistic and molecular data in LLMs, advancing scientific AI capabilities.

Findings

01

Achieves strong performance across chemical tasks

02

Surpasses some proprietary models in several dimensions

03

Provides a systematic blueprint for scientific LLMs

Abstract

Large Language Models (LLMs) are central to the one-for-all intelligent paradigm, but they face a fundamental challenge when dealing with heterogeneous scientific data such as molecules: the inherent gap between discrete linguistic symbols and topological molecular or continuous reaction data leads to significant information loss and semantic noise in text-based reasoning. We propose SciCore-Mol, a modular framework that bridges this gap through three deeply integrated pluggable cognitive modules: a topology-aware perception module, a latent diffusion-based molecular generation module, and a reaction-aware reasoning module. Each module is coupled to the LLM backbone through learned representation interfaces, enabling richer information exchange than is possible with text-only tool feedback. Our experiments on diverse chemical tasks demonstrate that SciCore-Mol achieves strong…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.