Boosting LLM Translation Skills without General Ability Loss via   Rationale Distillation

Junhong Wu; Yang Zhao; Yangyifan Xu; Bing Liu; Chengqing Zong

arXiv:2410.13944·cs.CL·October 21, 2024

Boosting LLM Translation Skills without General Ability Loss via Rationale Distillation

Junhong Wu, Yang Zhao, Yangyifan Xu, Bing Liu, Chengqing Zong

PDF

Open Access

TL;DR

This paper introduces RaDis, a novel rationale distillation method that improves LLM translation skills without sacrificing their general abilities, by using generated rationales to prevent forgetting during fine-tuning.

Contribution

RaDis leverages self-generated rationales to enhance translation performance while preserving LLMs' broad capabilities, addressing limitations of traditional fine-tuning methods.

Findings

01

Improved translation accuracy demonstrated in experiments.

02

Maintained general abilities across multiple NLP tasks.

03

Reduced catastrophic forgetting during fine-tuning.

Abstract

Large Language Models (LLMs) have achieved impressive results across numerous NLP tasks but still encounter difficulties in machine translation. Traditional methods to improve translation have typically involved fine-tuning LLMs using parallel corpora. However, vanilla fine-tuning often leads to catastrophic forgetting of the instruction-following capabilities and alignment with human preferences, compromising their broad general abilities and introducing potential security risks. These abilities, which are developed using proprietary and unavailable training data, make existing continual instruction tuning methods ineffective. To overcome this issue, we propose a novel approach called RaDis (Rationale Distillation). RaDis harnesses the strong generative capabilities of LLMs to create rationales for training data, which are then "replayed" to prevent forgetting. These rationales…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Text Readability and Simplification