SciCore-Mol: Augmenting Large Language Models with Pluggable Molecular Cognition Modules
Yuxuan Chen, Changwei Lv, Yunduo Xiao, Zhongjing Du, Daquan Zhou, Yukun Yan, Zheni Zeng, Zhiyuan Liu

TL;DR
SciCore-Mol enhances large language models with specialized modules for molecular perception, generation, and reasoning, significantly improving performance on chemical tasks and enabling scientific discovery.
Contribution
It introduces a modular framework with pluggable cognitive modules that bridge the gap between linguistic and molecular data in LLMs, advancing scientific AI capabilities.
Findings
Achieves strong performance across chemical tasks
Surpasses some proprietary models in several dimensions
Provides a systematic blueprint for scientific LLMs
Abstract
Large Language Models (LLMs) are central to the one-for-all intelligent paradigm, but they face a fundamental challenge when dealing with heterogeneous scientific data such as molecules: the inherent gap between discrete linguistic symbols and topological molecular or continuous reaction data leads to significant information loss and semantic noise in text-based reasoning. We propose SciCore-Mol, a modular framework that bridges this gap through three deeply integrated pluggable cognitive modules: a topology-aware perception module, a latent diffusion-based molecular generation module, and a reaction-aware reasoning module. Each module is coupled to the LLM backbone through learned representation interfaces, enabling richer information exchange than is possible with text-only tool feedback. Our experiments on diverse chemical tasks demonstrate that SciCore-Mol achieves strong…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
