Hyacinth6B: A large language model for Traditional Chinese

Chih-Wei Song; Yin-Te Tsai

arXiv:2403.13334·cs.CL·March 27, 2024·1 cites

Hyacinth6B: A large language model for Traditional Chinese

Chih-Wei Song, Yin-Te Tsai

PDF

Open Access 1 Models

TL;DR

Hyacinth6B is a lightweight large language model for Traditional Chinese that balances performance and resource efficiency through parameter-efficient fine-tuning, aiming to maximize capabilities with lower hardware demands.

Contribution

The paper introduces Hyacinth6B, a resource-efficient LLM for Traditional Chinese, utilizing LoRA fine-tuning to achieve high performance with reduced hardware requirements.

Findings

01

Hyacinth6B achieves competitive performance on Chinese NLP tasks.

02

Parameter-efficient fine-tuning reduces training costs.

03

Model demonstrates effective balance between size and accuracy.

Abstract

This research's primary motivation of this study is to address the high hardware and computational demands typically associated with LLMs.Therefore,our goal is to find a balance between model lightness and performance,striving to maximize performance while using a comparatively lightweight model. Hyacinth6B was developed with this objective in mind,aiming to fully leverage the core capabilities of LLMs without incurring substantial resource costs, effectively pushing the boundaries of smaller model's performance. The training approach involves parameter efficient finetuning using the LoRA method.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
chillymiao/Hyacinth6B
model· 2 dl· ♡ 1
2 dl♡ 1

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques