QueEn: A Large Language Model for Quechua-English Translation

Junhao Chen; Peng Shu; Yiwei Li; Huaqin Zhao; Hanqi Jiang; Yi Pan,; Yifan Zhou; Zhengliang Liu; Lewis C Howe; Tianming Liu

arXiv:2412.05184·cs.CL·December 9, 2024

QueEn: A Large Language Model for Quechua-English Translation

Junhao Chen, Peng Shu, Yiwei Li, Huaqin Zhao, Hanqi Jiang, Yi Pan,, Yifan Zhou, Zhengliang Liu, Lewis C Howe, Tianming Liu

PDF

Open Access

TL;DR

This paper introduces QueEn, a novel translation system for Quechua-English that combines retrieval-augmented generation with efficient fine-tuning, significantly improving translation quality for a low-resource language.

Contribution

The paper presents a new method integrating RAG and LoRA for low-resource language translation, demonstrating substantial performance improvements.

Findings

01

BLEU score of 17.6 for Quechua-English translation

02

Outperforms baseline models with BLEU of 1.5

03

Addresses challenges of low-resource language translation

Abstract

Recent studies show that large language models (LLMs) are powerful tools for working with natural language, bringing advances in many areas of computational linguistics. However, these models face challenges when applied to low-resource languages due to limited training data and difficulty in understanding cultural nuances. In this paper, we propose QueEn, a novel approach for Quechua-English translation that combines Retrieval-Augmented Generation (RAG) with parameter-efficient fine-tuning techniques. Our method leverages external linguistic resources through RAG and uses Low-Rank Adaptation (LoRA) for efficient model adaptation. Experimental results show that our approach substantially exceeds baseline models, with a BLEU score of 17.6 compared to 1.5 for standard GPT models. The integration of RAG with fine-tuning allows our system to address the challenges of low-resource language…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques

MethodsAttention Is All You Need · Linear Warmup With Linear Decay · WordPiece · Layer Normalization · Residual Connection · Adam · Dense Connections · Cosine Annealing · Dropout · Attention Dropout