ChemLLM: A Chemical Large Language Model

Di Zhang; Wei Liu; Qian Tan; Jingdan Chen; Hang Yan; Yuliang Yan,; Jiatong Li; Weiran Huang; Xiangyu Yue; Wanli Ouyang; Dongzhan Zhou; Shufei; Zhang; Mao Su; Han-Sen Zhong; Yuqiang Li

arXiv:2402.06852·cs.AI·April 26, 2024·41 cites

ChemLLM: A Chemical Large Language Model

Di Zhang, Wei Liu, Qian Tan, Jingdan Chen, Hang Yan, Yuliang Yan,, Jiatong Li, Weiran Huang, Xiangyu Yue, Wanli Ouyang, Dongzhan Zhou, Shufei, Zhang, Mao Su, Han-Sen Zhong, Yuqiang Li

PDF

Open Access 1 Repo 9 Models 5 Datasets

TL;DR

ChemLLM introduces a dedicated large language model for chemistry, integrating structured chemical knowledge, a specialized dataset, and a comprehensive benchmark, achieving high performance on core chemical tasks and enabling advanced dialogue interactions.

Contribution

It is the first chemistry-specific LLM that combines structured knowledge, instruction tuning data, and a dedicated benchmark to improve chemical task performance.

Findings

01

ChemLLM performs comparably to GPT-4 on core chemical tasks.

02

It demonstrates competitive performance with similarly sized LLMs in general scenarios.

03

The framework sets a new standard for LLMs in scientific domains.

Abstract

Large language models (LLMs) have made impressive progress in chemistry applications. However, the community lacks an LLM specifically designed for chemistry. The main challenges are two-fold: firstly, most chemical data and scientific knowledge are stored in structured databases, which limits the model's ability to sustain coherent dialogue when used directly. Secondly, there is an absence of objective and fair benchmark that encompass most chemistry tasks. Here, we introduce ChemLLM, a comprehensive framework that features the first LLM dedicated to chemistry. It also includes ChemData, a dataset specifically designed for instruction tuning, and ChemBench, a robust benchmark covering nine essential chemistry tasks. ChemLLM is adept at performing various tasks across chemical disciplines with fluid dialogue interaction. Notably, ChemLLM achieves results comparable to GPT-4 on the core…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

keyhsw/chemllm
pytorch

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational Drug Discovery Methods

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Position-Wise Feed-Forward Layer · Label Smoothing · Cosine Annealing · Absolute Position Encodings · Byte Pair Encoding · Linear Layer · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Dropout · Attention Is All You Need