Multilingual Neural Machine Translation with Deep Encoder and Multiple Shallow Decoders
Xiang Kong, Adithya Renduchintala, James Cross, Yuqing Tang, Jiatao, Gu, Xian Li

TL;DR
This paper explores a multilingual translation model with a deep encoder and multiple shallow decoders, achieving faster inference with minimal quality loss for many-to-one translation, but facing challenges in one-to-many scenarios.
Contribution
It introduces the DEMSD model that uses a deep encoder with multiple shallow decoders tailored for different language subsets, improving speed without quality loss in many-to-one translation.
Findings
Achieves 1.8x speedup with no quality drop in many-to-one translation.
Shallow decoders cause quality drop in one-to-many translation, but DEMSD mitigates this.
Deep encoder with multiple shallow decoders is effective for efficiency in multilingual translation.
Abstract
Recent work in multilingual translation advances translation quality surpassing bilingual baselines using deep transformer models with increased capacity. However, the extra latency and memory costs introduced by this approach may make it unacceptable for efficiency-constrained applications. It has recently been shown for bilingual translation that using a deep encoder and shallow decoder (DESD) can reduce inference latency while maintaining translation quality, so we study similar speed-accuracy trade-offs for multilingual translation. We find that for many-to-one translation we can indeed increase decoder speed without sacrificing quality using this approach, but for one-to-many translation, shallow decoders cause a clear quality drop. To ameliorate this drop, we propose a deep encoder with multiple shallow decoders (DEMSD) where each shallow decoder is responsible for a disjoint…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Machine Learning in Bioinformatics · Topic Modeling
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
