MolEdit: Knowledge Editing for Multimodal Molecule Language Models

Zhenyu Lei; Patrick Soga; Yaochen Zhu; Yinhan He; Yushun Dong; Jundong Li

arXiv:2511.12770·cs.LG·December 1, 2025

MolEdit: Knowledge Editing for Multimodal Molecule Language Models

Zhenyu Lei, Patrick Soga, Yaochen Zhu, Yinhan He, Yushun Dong, Jundong Li

PDF

Open Access

TL;DR

MolEdit introduces a novel framework for precise knowledge editing in multimodal molecule language models, improving accuracy and robustness while preserving unrelated molecular knowledge.

Contribution

This paper pioneers knowledge editing for MoLMs, proposing MolEdit with specialized adapters and an expertise-aware switcher to target specific molecular knowledge updates.

Findings

01

Up to 18.8% improvement in editing reliability

02

12.0% better preservation of unrelated knowledge

03

Effective across multiple MoLM architectures

Abstract

Understanding and continuously refining multimodal molecular knowledge is crucial for advancing biomedicine, chemistry, and materials science. Molecule language models (MoLMs) have become powerful tools in these domains, integrating structural representations (e.g., SMILES strings, molecular graphs) with rich contextual descriptions (e.g., physicochemical properties). However, MoLMs can encode and propagate inaccuracies due to outdated web-mined training corpora or malicious manipulation, jeopardizing downstream discovery pipelines. While knowledge editing has been explored for general-domain AI, its application to MoLMs remains uncharted, presenting unique challenges due to the multifaceted and interdependent nature of molecular knowledge. In this paper, we take the first step toward MoLM editing for two critical tasks: molecule-to-caption generation and caption-to-molecule generation.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBiomedical Text Mining and Ontologies · Machine Learning in Materials Science · Topic Modeling