Can Large Language Models Understand Molecules?
Shaghayegh Sadeghi, Alan Bui, Ali Forooghi, Jianguo Lu, Alioune Ngom

TL;DR
This study evaluates the ability of large language models, specifically GPT and LLaMA, to understand and generate molecular embeddings from SMILES strings, demonstrating LLaMA's superior performance in molecular property and drug-drug interaction prediction tasks.
Contribution
It compares GPT and LLaMA models for molecular embedding from SMILES, showing LLaMA's effectiveness and potential in cheminformatics applications.
Findings
LLaMA outperforms GPT in molecular property prediction
LLaMA's SMILES embeddings are comparable to pre-trained models
LLaMA surpasses pre-trained models in drug-drug interaction prediction
Abstract
Purpose: Large Language Models (LLMs) like GPT (Generative Pre-trained Transformer) from OpenAI and LLaMA (Large Language Model Meta AI) from Meta AI are increasingly recognized for their potential in the field of cheminformatics, particularly in understanding Simplified Molecular Input Line Entry System (SMILES), a standard method for representing chemical structures. These LLMs also have the ability to decode SMILES strings into vector representations. Method: We investigate the performance of GPT and LLaMA compared to pre-trained models on SMILES in embedding SMILES strings on downstream tasks, focusing on two key applications: molecular property prediction and drug-drug interaction prediction. Results: We find that SMILES embeddings generated using LLaMA outperform those from GPT in both molecular property and DDI prediction tasks. Notably, LLaMA-based SMILES embeddings show…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Advanced Biosensing Techniques and Applications
MethodsAttention Is All You Need · Dense Connections · Cosine Annealing · Linear Layer · Weight Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Linear Warmup With Cosine Annealing · Residual Connection · Byte Pair Encoding · Adam
