Silo NLP's Participation at WAT2022

Shantipriya Parida; Subhadarshi Panda; Stig-Arne Gr\"onroos; Mark; Granroth-Wilding; Mika Koistinen

arXiv:2208.01296·cs.CL·August 3, 2022·1 cites

Silo NLP's Participation at WAT2022

Shantipriya Parida, Subhadarshi Panda, Stig-Arne Gr\"onroos, Mark, Granroth-Wilding, Mika Koistinen

PDF

Open Access 1 Repo

TL;DR

Silo NLP participated in WAT2022, developing Transformer-based models for multilingual and multimodal translation tasks, achieving top results in several language pairs and modalities.

Contribution

The paper introduces a system combining Transformer models and multimodal features for Asian language translation, with state-of-the-art performance.

Findings

01

Top performance in multiple translation tasks

02

Effective use of object tags as visual features

03

Successful fine-tuning of mBART-50 models

Abstract

This paper provides the system description of "Silo NLP's" submission to the Workshop on Asian Translation (WAT2022). We have participated in the Indic Multimodal tasks (English->Hindi, English->Malayalam, and English->Bengali Multimodal Translation). For text-only translation, we trained Transformers from scratch and fine-tuned mBART-50 models. For multimodal translation, we used the same mBART architecture and extracted object tags from the images to use as visual features concatenated with the text sequence. Our submission tops many tasks including English->Hindi multimodal translation (evaluation test), English->Malayalam text-only and multimodal translation (evaluation test), English->Bengali multimodal translation (challenge test), and English->Bengali text-only translation (evaluation test).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shantipriyap/silonlp_wat2022
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Natural Language Processing Techniques · Handwritten Text Recognition Techniques

MethodsmBART