Scalable Machine Translation in Memory Constrained Environments
Paul Baltescu

TL;DR
This paper presents methods to adapt statistical machine translation systems for mobile devices with limited memory, enabling scalable translation without relying on constant internet access.
Contribution
The paper introduces alternative components for SMT systems that significantly reduce memory usage, making translation feasible on resource-constrained mobile devices.
Findings
Memory-efficient translation components developed
Achieved scalable translation on mobile devices
Reduced dependency on internet connectivity
Abstract
Machine translation is the discipline concerned with developing automated tools for translating from one human language to another. Statistical machine translation (SMT) is the dominant paradigm in this field. In SMT, translations are generated by means of statistical models whose parameters are learned from bilingual data. Scalability is a key concern in SMT, as one would like to make use of as much data as possible to train better translation systems. In recent years, mobile devices with adequate computing power have become widely available. Despite being very successful, mobile applications relying on NLP systems continue to follow a client-server architecture, which is of limited use because access to internet is often limited and expensive. The goal of this dissertation is to show how to construct a scalable machine translation system that can operate with the limited resources…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Algorithms and Data Compression
