The Low-Resource Double Bind: An Empirical Study of Pruning for Low-Resource Machine Translation
Orevaoghene Ahia, Julia Kreutzer, Sara Hooker

TL;DR
This study investigates how pruning neural networks affects low-resource machine translation, revealing that sparsity maintains performance on common sentences, enhances robustness to out-of-distribution data, and helps mitigate memorization of infrequent attributes.
Contribution
It provides the first empirical analysis of pruning effects in low-resource translation, highlighting its benefits for robustness and generalization in data-limited settings.
Findings
Sparsity preserves performance on frequent sentences.
Pruning improves robustness to out-of-distribution shifts.
Sparsity reduces memorization of low-frequency attributes.
Abstract
A "bigger is better" explosion in the number of parameters in deep neural networks has made it increasingly challenging to make state-of-the-art networks accessible in compute-restricted environments. Compression techniques have taken on renewed importance as a way to bridge the gap. However, evaluation of the trade-offs incurred by popular compression techniques has been centered on high-resource datasets. In this work, we instead consider the impact of compression in a data-limited regime. We introduce the term low-resource double bind to refer to the co-occurrence of data limitations and compute resource constraints. This is a common setting for NLP for low-resource languages, yet the trade-offs in performance are poorly studied. Our work offers surprising insights into the relationship between capacity and generalization in data-limited regimes for the task of machine translation.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsPruning
