Steering Large Language Models for Machine Translation with Finetuning   and In-Context Learning

Duarte M. Alves; Nuno M. Guerreiro; Jo\~ao Alves; Jos\'e Pombal,; Ricardo Rei; Jos\'e G. C. de Souza; Pierre Colombo; Andr\'e F. T. Martins

arXiv:2310.13448·cs.CL·October 23, 2023·2 cites

Steering Large Language Models for Machine Translation with Finetuning and In-Context Learning

Duarte M. Alves, Nuno M. Guerreiro, Jo\~ao Alves, Jos\'e Pombal,, Ricardo Rei, Jos\'e G. C. de Souza, Pierre Colombo, Andr\'e F. T. Martins

PDF

Open Access

TL;DR

This paper explores finetuning large language models for machine translation, demonstrating that adapter-based methods can outperform few-shot prompting, and proposes a hybrid approach to retain few-shot capabilities while benefiting from finetuning.

Contribution

It introduces an adapter-based finetuning method that matches traditional finetuning performance with fewer parameters and proposes a hybrid approach to preserve few-shot learning abilities.

Findings

01

Adapter-based finetuning matches traditional finetuning performance.

02

Finetuning can degrade few-shot capabilities.

03

Hybrid approach recovers few-shot performance while benefiting from finetuning.

Abstract

Large language models (LLMs) are a promising avenue for machine translation (MT). However, current LLM-based MT systems are brittle: their effectiveness highly depends on the choice of few-shot examples and they often require extra post-processing due to overgeneration. Alternatives such as finetuning on translation instructions are computationally expensive and may weaken in-context learning capabilities, due to overspecialization. In this paper, we provide a closer look at this problem. We start by showing that adapter-based finetuning with LoRA matches the performance of traditional finetuning while reducing the number of training parameters by a factor of 50. This method also outperforms few-shot prompting and eliminates the need for post-processing or in-context examples. However, we show that finetuning generally degrades few-shot performance, hindering adaptation capabilities.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications