Ladder: A Model-Agnostic Framework Boosting LLM-based Machine   Translation to the Next Level

Zhaopeng Feng; Ruizhe Chen; Yan Zhang; Zijie Meng; Zuozhu Liu

arXiv:2406.15741·cs.CL·October 30, 2024·1 cites

Ladder: A Model-Agnostic Framework Boosting LLM-based Machine Translation to the Next Level

Zhaopeng Feng, Ruizhe Chen, Yan Zhang, Zijie Meng, Zuozhu Liu

PDF

Open Access 3 Repos 1 Video

TL;DR

This paper introduces MT-Ladder, a model-agnostic, cost-effective framework that enhances general-purpose LLMs for machine translation through hierarchical fine-tuning on pseudo-refinement data, achieving performance comparable to top-tier models.

Contribution

The paper presents MT-Ladder, a novel hierarchical fine-tuning method that improves LLM-based machine translation without additional human annotation or large-scale training.

Findings

01

MT-Ladder improves translation BLEU scores significantly.

02

MT-Ladder-2B matches top open-source models' performance.

03

MT-Ladder-7B approaches GPT-4-level translation quality.

Abstract

General-purpose Large Language Models (LLMs) like GPT-4 have achieved remarkable advancements in machine translation (MT) by leveraging extensive web content. On the other hand, translation-specific LLMs are built by pre-training on domain-specific monolingual corpora and fine-tuning with human-annotated translation data. Despite the superior performance, these methods either demand an unprecedented scale of computing and data or substantial human editing and annotation efforts. In this paper, we develop MT-Ladder, a novel model-agnostic and cost-effective tool to refine the performance of general LLMs for MT. MT-Ladder is trained on pseudo-refinement triplets which can be easily obtained from existing LLMs without additional human cost. During training, we propose a hierarchical fine-tuning strategy with an easy-to-hard schema, improving MT-Ladder's refining performance progressively.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Ladder: A Model-Agnostic Framework Boosting LLM-based Machine Translation to the Next Level· underline

Taxonomy

TopicsNatural Language Processing Techniques · Mathematics, Computing, and Information Processing

MethodsAttention Is All You Need · Softmax · Layer Normalization · Absolute Position Encodings · Byte Pair Encoding · Label Smoothing · Position-Wise Feed-Forward Layer · Dropout · Adam · Linear Layer