Adapting Language-Specific LLMs to a Reasoning Model in One Day via   Model Merging -- An Open Recipe

Kunat Pipatanakul; Pittawat Taveekitworachai; Potsawee Manakul; Kasima; Tharnpipitchai

arXiv:2502.09056·cs.CL·March 28, 2025

Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging -- An Open Recipe

Kunat Pipatanakul, Pittawat Taveekitworachai, Potsawee Manakul, Kasima, Tharnpipitchai

PDF

Open Access 1 Models 1 Datasets

TL;DR

This paper presents a method to quickly adapt language-specific LLMs, like Thai, to advanced reasoning models such as DeepSeek R1 within a day, using model merging and data selection, without losing language-specific performance.

Contribution

It introduces a simple, open recipe for merging models to enhance reasoning in low-resource language LLMs efficiently and cost-effectively.

Findings

01

Enhanced reasoning capabilities in Thai LLM to match DeepSeek R1

02

Achieved this with only publicly available data and $120 computational budget

03

Maintained language-specific performance after merging

Abstract

This paper investigates data selection and model merging methodologies aimed at incorporating advanced reasoning capabilities such as those of DeepSeek R1 into language-specific large language models (LLMs), with a particular focus on the Thai LLM. Our goal is to enhance the reasoning capabilities of language-specific LLMs while maintaining their target language abilities. DeepSeek R1 excels in reasoning but primarily benefits high-resource languages such as English and Chinese. However, low-resource languages remain underserved due to the dominance of English-centric training data and model optimizations, which limit performance in these languages. This limitation results in unreliable code-switching and diminished effectiveness on tasks in low-resource languages. Meanwhile, local and regional LLM initiatives have attempted to bridge this gap by developing language-specific LLMs that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
typhoon-ai/llama3.1-typhoon2-deepseek-r1-70b-preview
model· 16 dl· ♡ 13
16 dl♡ 13

Datasets

typhoon-ai/typhoon-r1-sft-data
dataset· 37 dl
37 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Semantic Web and Ontologies · Topic Modeling

MethodsFocus