Typhoon T1: An Open Thai Reasoning Model

Pittawat Taveekitworachai; Potsawee Manakul; Kasima Tharnpipitchai,; Kunat Pipatanakul

arXiv:2502.09042·cs.CL·March 28, 2025

Typhoon T1: An Open Thai Reasoning Model

Pittawat Taveekitworachai, Potsawee Manakul, Kasima Tharnpipitchai,, Kunat Pipatanakul

PDF

Open Access 1 Models 2 Datasets

TL;DR

This paper presents Typhoon T1, an open Thai reasoning model built with supervised fine-tuning on open datasets, capable of generating reasoning traces in a low-resource language, aiming to advance research in this area.

Contribution

It introduces a cost-effective approach to developing reasoning models in low-resource languages using supervised fine-tuning and shares detailed datasets, model weights, and insights.

Findings

01

Successfully generated reasoning traces in Thai.

02

Demonstrated generalization across multiple domains.

03

Provided open datasets and model weights for research use.

Abstract

This paper introduces Typhoon T1, an open effort to develop an open Thai reasoning model. A reasoning model is a relatively new type of generative model built on top of large language models (LLMs). A reasoning model generates a long chain of thought before arriving at a final answer, an approach found to improve performance on complex tasks. However, details on developing such a model are limited, especially for reasoning models that can generate traces in a low-resource language. Typhoon T1 presents an open effort that dives into the details of developing a reasoning model in a more cost-effective way by leveraging supervised fine-tuning using open datasets, instead of reinforcement learning. This paper shares the details about synthetic data generation and training, as well as our dataset and model weights. Additionally, we provide insights gained from developing a reasoning model…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
typhoon-ai/llama3.2-typhoon2-t1-3b-research-preview
model· 54 dl· ♡ 6
54 dl♡ 6

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLogic, Reasoning, and Knowledge