MolEdit: Knowledge Editing for Multimodal Molecule Language Models
Zhenyu Lei, Patrick Soga, Yaochen Zhu, Yinhan He, Yushun Dong, Jundong Li

TL;DR
MolEdit introduces a novel framework for precise knowledge editing in multimodal molecule language models, improving accuracy and robustness while preserving unrelated molecular knowledge.
Contribution
This paper pioneers knowledge editing for MoLMs, proposing MolEdit with specialized adapters and an expertise-aware switcher to target specific molecular knowledge updates.
Findings
Up to 18.8% improvement in editing reliability
12.0% better preservation of unrelated knowledge
Effective across multiple MoLM architectures
Abstract
Understanding and continuously refining multimodal molecular knowledge is crucial for advancing biomedicine, chemistry, and materials science. Molecule language models (MoLMs) have become powerful tools in these domains, integrating structural representations (e.g., SMILES strings, molecular graphs) with rich contextual descriptions (e.g., physicochemical properties). However, MoLMs can encode and propagate inaccuracies due to outdated web-mined training corpora or malicious manipulation, jeopardizing downstream discovery pipelines. While knowledge editing has been explored for general-domain AI, its application to MoLMs remains uncharted, presenting unique challenges due to the multifaceted and interdependent nature of molecular knowledge. In this paper, we take the first step toward MoLM editing for two critical tasks: molecule-to-caption generation and caption-to-molecule generation.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Machine Learning in Materials Science · Topic Modeling
