EPIK: Eliminating multi-model Pipelines with Knowledge-distillation

Bhavesh Laddagiri; Yash Raj; Anshuman Dash

arXiv:2211.14920·cs.CL·November 29, 2022

EPIK: Eliminating multi-model Pipelines with Knowledge-distillation

Bhavesh Laddagiri, Yash Raj, Anshuman Dash

PDF

Open Access

TL;DR

EPIK introduces a novel knowledge distillation method that converts multi-model, multi-stage transliteration pipelines into a single end-to-end model, maintaining performance while reducing complexity and execution time.

Contribution

The paper presents EPIK, a new distillation technique that creates end-to-end models from multi-stage pipelines without needing dedicated datasets, improving efficiency and performance.

Findings

01

EPIK achieves CER of 0.015 and phonetic accuracy of 92.1%.

02

Model reduces execution time by 54.3%.

03

EPIK sometimes outperforms the original pipeline.

Abstract

Real-world tasks are largely composed of multiple models, each performing a sub-task in a larger chain of tasks, i.e., using the output from a model as input for another model in a multi-model pipeline. A model like MATRa performs the task of Crosslingual Transliteration in two stages, using English as an intermediate transliteration target when transliterating between two indic languages. We propose a novel distillation technique, EPIK, that condenses two-stage pipelines for hierarchical tasks into a single end-to-end model without compromising performance. This method can create end-to-end models for tasks without needing a dedicated end-to-end dataset, solving the data scarcity problem. The EPIK model has been distilled from the MATra model using this technique of knowledge distillation. The MATra model can perform crosslingual transliteration between 5 languages - English, Hindi,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Multimodal Machine Learning Applications · Topic Modeling