Life Cycle-Aware Evaluation of Knowledge Distillation for Machine Translation: Environmental Impact and Translation Quality Trade-offs
Joseph Attieh, Timothee Mickus, Anne-Laure Ligozat, Aur\'elie N\'ev\'eol, J\"org Tiedemann

TL;DR
This paper evaluates knowledge distillation in machine translation by analyzing both translation quality and environmental impact, providing a comprehensive protocol for selecting methods based on compute and quality trade-offs.
Contribution
It introduces a life cycle-aware assessment framework that considers environmental impact alongside translation quality for KD methods in machine translation.
Findings
Distillation overhead is dominant at small deployment volumes.
Inference costs dominate at large scales, affecting KD benefits.
Word-level distillation offers better footprint-quality trade-offs than sequence-level.
Abstract
Knowledge distillation (KD) is a tool to compress a larger system (teacher) into a smaller one (student). In machine translation, studies typically report only the translation quality of the student and omit the computational complexity of performing KD, making it difficult to select among the many available KD choices under compute-induced constraints. In this study, we evaluate representative KD methods by considering both translation quality and computational cost. We express computational cost as a carbon footprint using the machine learning life cycle assessment (MLCA) tool. This assessment accounts for runtime operational emissions and amortized hardware production costs throughout the KD model life cycle (teacher training, distillation, and inference). We find that (i) distillation overhead dominates the total footprint at small deployment volumes, (ii) inference dominates at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGreen IT and Sustainability · Big Data and Digital Economy · Machine Learning in Materials Science
