MolTailor: Tailoring Chemical Molecular Representation to Specific Tasks via Text Prompts
Haoqiang Guo, Sendong Zhao, Haochun Wang, Yanrui Du, Bing Qin

TL;DR
MolTailor leverages language models to customize molecular representations based on task-specific natural language descriptions, improving predictive accuracy in drug discovery applications.
Contribution
This paper introduces MolTailor, a novel method that uses language models as an agent to tailor molecular representations to specific tasks, enhancing relevance and performance.
Findings
MolTailor outperforms baseline methods in molecular property prediction.
Task-specific natural language prompts improve molecular representation relevance.
Language model-guided optimization enhances molecular learning efficiency.
Abstract
Deep learning is now widely used in drug discovery, providing significant acceleration and cost reduction. As the most fundamental building block, molecular representation is essential for predicting molecular properties to enable various downstream applications. Most existing methods attempt to incorporate more information to learn better representations. However, not all features are equally important for a specific task. Ignoring this would potentially compromise the training efficiency and predictive accuracy. To address this issue, we propose a novel approach, which treats language models as an agent and molecular pretraining models as a knowledge base. The agent accentuates task-relevant features in the molecular representation by understanding the natural language description of the task, just as a tailor customizes clothes for clients. Thus, we call this approach MolTailor.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods · Chemical Synthesis and Analysis
