SynAsk: Unleashing the Power of Large Language Models in Organic Synthesis
Chonghuan Zhang, Qianghua Lin, Biwei Zhu, Haopeng Yang, Xiao Lian, Hao, Deng, Jiajun Zheng, Kuangbiao Liao

TL;DR
SynAsk is a specialized large language model platform for organic chemistry that combines fine-tuning and external resources to assist researchers with synthesis, literature, and molecular data in a question-answer format.
Contribution
It introduces a domain-specific LLM for organic chemistry that integrates knowledge bases and chemistry tools using a novel fine-tuning and resource integration approach.
Findings
Enhanced prediction of chemical reactions and retrosynthesis.
Seamless access to chemical literature and molecular data.
Improved research efficiency in organic synthesis.
Abstract
The field of natural language processing (NLP) has witnessed a transformative shift with the emergence of large language models (LLMs), revolutionizing various language tasks and applications, and the integration of LLM into specialized domains enhances their capabilities for domain-specific applications. Notably, NLP has made significant strides in organic chemistry, particularly in predicting synthetic tasks, paving the way for the development of LLMs tailored to the organic chemistry field. In this work, we introduce SynAsk, a comprehensive organic chemistry domain-specific LLM platform developed by AIChemEco Inc. By finetuning an LLM with domain-specific data and integrating it with a chain of thought approach, SynAsk seamlessly accesses our knowledge base and advanced chemistry tools in a question-and-answer format. This includes functionalities such as a basic chemistry knowledge…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science
