OpenThaiGPT 1.5: A Thai-Centric Open Source Large Language Model
Sumeth Yuenyong, Kobkrit Viriyayudhakorn, Apivadee Piyatumrong,, Jillaphat Jaroenkantasima

TL;DR
OpenThaiGPT 1.5 is a state-of-the-art Thai language chat model based on Qwen v2.5, finetuned on extensive instruction data, supporting multi-turn conversations, RAG, and tool-calling, with strong benchmark performance.
Contribution
This paper introduces OpenThaiGPT 1.5, a new open-source Thai language model with advanced features and improved performance over existing models.
Findings
Outperforms other open-source Thai models on benchmarks
Supports multi-turn conversations and tool integration
Demonstrates practical deployment strategies
Abstract
OpenThaiGPT 1.5 is an advanced Thai language chat model based on Qwen v2.5, finetuned on over 2,000,000 Thai instruction pairs. This report provides an engineering perspective on the model's development, capabilities, and performance. We discuss the model's architecture, training process, and key features, including multi-turn conversation support, Retrieval Augmented Generation (RAG) compatibility, and tool-calling functionality. Benchmark results demonstrate OpenThaiGPT 1.5's state-of-the-art performance on various Thai language tasks, outperforming other open-source Thai language models. We also address practical considerations such as GPU memory requirements and deployment strategies.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗openthaigpt/openthaigpt-1.0.0-7b-chatmodel· 352 dl· ♡ 17352 dl♡ 17
- 🤗openthaigpt/openthaigpt-1.0.0-13b-chatmodel· 141 dl· ♡ 7141 dl♡ 7
- 🤗openthaigpt/openthaigpt-1.0.0-70b-chatmodel· 151 dl· ♡ 12151 dl♡ 12
- 🤗openthaigpt/openthaigpt-1.0.0-7b-chat-ggufmodel· 61 dl· ♡ 761 dl♡ 7
- 🤗openthaigpt/openthaigpt1.5-7b-instructmodel· 3.1k dl· ♡ 163.1k dl♡ 16
- 🤗openthaigpt/openthaigpt1.5-72b-instructmodel· 41 dl· ♡ 1041 dl♡ 10
- 🤗openthaigpt/openthaigpt1.5-14b-instructmodel· 459 dl· ♡ 6459 dl♡ 6
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis
