ChemLLM: A Chemical Large Language Model
Di Zhang, Wei Liu, Qian Tan, Jingdan Chen, Hang Yan, Yuliang Yan,, Jiatong Li, Weiran Huang, Xiangyu Yue, Wanli Ouyang, Dongzhan Zhou, Shufei, Zhang, Mao Su, Han-Sen Zhong, Yuqiang Li

TL;DR
ChemLLM introduces a dedicated large language model for chemistry, integrating structured chemical knowledge, a specialized dataset, and a comprehensive benchmark, achieving high performance on core chemical tasks and enabling advanced dialogue interactions.
Contribution
It is the first chemistry-specific LLM that combines structured knowledge, instruction tuning data, and a dedicated benchmark to improve chemical task performance.
Findings
ChemLLM performs comparably to GPT-4 on core chemical tasks.
It demonstrates competitive performance with similarly sized LLMs in general scenarios.
The framework sets a new standard for LLMs in scientific domains.
Abstract
Large language models (LLMs) have made impressive progress in chemistry applications. However, the community lacks an LLM specifically designed for chemistry. The main challenges are two-fold: firstly, most chemical data and scientific knowledge are stored in structured databases, which limits the model's ability to sustain coherent dialogue when used directly. Secondly, there is an absence of objective and fair benchmark that encompass most chemistry tasks. Here, we introduce ChemLLM, a comprehensive framework that features the first LLM dedicated to chemistry. It also includes ChemData, a dataset specifically designed for instruction tuning, and ChemBench, a robust benchmark covering nine essential chemistry tasks. ChemLLM is adept at performing various tasks across chemical disciplines with fluid dialogue interaction. Notably, ChemLLM achieves results comparable to GPT-4 on the core…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗AI4Chem/ChemLLM-7B-Chatmodel· 414 dl· ♡ 83414 dl♡ 83
- 🤗AI4Chem/ChemLLM-7B-Chat-1_5-DPOmodel· 11k dl· ♡ 2711k dl♡ 27
- 🤗AI4Chem/ChemLLM-20B-Chat-SFTmodel· 10 dl· ♡ 510 dl♡ 5
- 🤗AI4Chem/ChemLLM-20B-Chat-DPOmodel· 173 dl· ♡ 10173 dl♡ 10
- 🤗AI4Chem/CHEMLLM-2b-1_5model· 62 dl· ♡ 162 dl♡ 1
- 🤗AI4Chem/ChemLLM-7B-Chat-1_5-SFTmodel· 43 dl· ♡ 543 dl♡ 5
- 🤗RichardErkhov/AI4Chem_-_CHEMLLM-2b-1_5-awqmodel· 1 dl1 dl
- 🤗aciidix/ChemLLM-7B-Chat-1_5-DPOmodel· 2 dl2 dl
- 🤗WhynotCicci/ChemLLM-7B-Chatmodel· 11 dl11 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Position-Wise Feed-Forward Layer · Label Smoothing · Cosine Annealing · Absolute Position Encodings · Byte Pair Encoding · Linear Layer · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Dropout · Attention Is All You Need
