The Breeze 2 Herd of Models: Traditional Chinese LLMs Based on Llama   with Vision-Aware and Function-Calling Capabilities

MediaTek Research: Chan-Jan Hsu; Chia-Sheng Liu; Meng-Hsi Chen; Muxi; Chen; Po-Chun Hsu; Yi-Chang Chen; Da-Shan Shiu

arXiv:2501.13921·cs.CL·February 12, 2025

The Breeze 2 Herd of Models: Traditional Chinese LLMs Based on Llama with Vision-Aware and Function-Calling Capabilities

MediaTek Research: Chan-Jan Hsu, Chia-Sheng Liu, Meng-Hsi Chen, Muxi, Chen, Po-Chun Hsu, Yi-Chang Chen, Da-Shan Shiu

PDF

Open Access 1 Repo 9 Models

TL;DR

Breeze2 is a suite of multi-modal Chinese language models based on Llama, enhanced with vision and function-calling capabilities, achieving state-of-the-art performance in its size class and released for public use.

Contribution

The paper introduces Breeze2, a new Chinese LLM with vision and function-calling features, built on Llama 3.2, and demonstrates its superior performance and open-source release.

Findings

01

Breeze2 outperforms existing models in Chinese function calling.

02

Breeze2 achieves strong results in vision understanding tasks.

03

Models are publicly available and demonstrated on mobile platforms.

Abstract

Llama-Breeze2 (hereinafter referred to as Breeze2) is a suite of advanced multi-modal language models, available in 3B and 8B parameter configurations, specifically designed to enhance Traditional Chinese language representation. Building upon the Llama 3.2 model family, we continue the pre-training of Breeze2 on an extensive corpus to enhance the linguistic and cultural heritage of Traditional Chinese. In addition to language modeling capabilities, we significantly augment the models with function calling and vision understanding capabilities. At the time of this publication, as far as we are aware, absent reasoning-inducing prompts, Breeze2 are the strongest performing models in Traditional Chinese function calling and image understanding in its size class. The effectiveness of Breeze2 is benchmarked across various tasks, including Taiwan general knowledge, instruction-following, long…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mtkresearch/mr-models
noneOfficial

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSubtitles and Audiovisual Media

MethodsLLaMA