End-to-End Speech Translation with Pre-trained Models and Adapters: UPC   at IWSLT 2021

Gerard I. G\'allego; Ioannis Tsiamas; Carlos Escolano; Jos\'e A. R.; Fonollosa; Marta R. Costa-juss\`a

arXiv:2105.04512·cs.CL·June 29, 2021

End-to-End Speech Translation with Pre-trained Models and Adapters: UPC at IWSLT 2021

Gerard I. G\'allego, Ioannis Tsiamas, Carlos Escolano, Jos\'e A. R., Fonollosa, Marta R. Costa-juss\`a

PDF

1 Repo

TL;DR

This paper presents an end-to-end speech translation system combining pre-trained models and adapters, achieving competitive BLEU scores on TED talk translation tasks with efficient fine-tuning and custom segmentation techniques.

Contribution

The paper introduces a novel end-to-end speech translation approach using pre-trained models with adapters and efficient fine-tuning, improving convergence and translation quality.

Findings

01

Achieved a BLEU score of 27.3 with the base model.

02

Ensemble model improved BLEU to 28.22.

03

Custom segmentation increased BLEU by 2.5-3 points.

Abstract

This paper describes the submission to the IWSLT 2021 offline speech translation task by the UPC Machine Translation group. The task consists of building a system capable of translating English audio recordings extracted from TED talks into German text. Submitted systems can be either cascade or end-to-end and use a custom or given segmentation. Our submission is an end-to-end speech translation system, which combines pre-trained models (Wav2Vec 2.0 and mBART) with coupling modules between the encoder and decoder, and uses an efficient fine-tuning technique, which trains only 20% of its total parameters. We show that adding an Adapter to the system and pre-training it, can increase the convergence speed and the final result, with which we achieve a BLEU score of 27.3 on the MuST-C test set. Our final model is an ensemble that obtains 28.22 BLEU score on the same set. Our submission also…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mt-upc/iwslt-2021
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.