AttriLens-Mol: Attribute Guided Reinforcement Learning for Molecular Property Prediction with Large Language Models
Xuan Lin, Long Chen, Yile Wang

TL;DR
AttriLens-Mol introduces an attribute-guided reinforcement learning framework that enhances molecular property prediction with large language models by focusing on relevant attributes, improving accuracy and interpretability.
Contribution
The paper presents a novel attribute-guided reinforcement learning approach that effectively elicits relevant molecular attributes during reasoning, outperforming existing methods in property prediction tasks.
Findings
Significantly improves prediction performance on multiple datasets.
Produces more relevant and interpretable molecular attributes.
Achieves comparable or better results than supervised fine-tuning and advanced models.
Abstract
Large Language Models (LLMs) have shown promise in assisting molecular property prediction tasks but often rely on human-crafted prompts and chain-of-thought templates. While recent advanced large reasoning models like DeepSeek-R1 employ reinforcement learning for an extended ``thinking'' process, their reasoning can be verbose and lack relevance. We introduce AttriLens-Mol, an attribute-guided reinforcement learning framework for molecular property prediction with LLMs. AttriLens-Mol steers the model's reasoning by using: (1) a format reward encouraging attribute-based structured output, (2) a count reward to avoid enumerating irrelevant attributes, and (3) a rationality reward using advanced LLMs and RDKit to verify the relatedness of the generated attributes. This approach implicitly elicits the model's inherent knowledge of relevant molecular attributes during reasoning, enables…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods · Chemical Synthesis and Analysis
