Language Model Knowledge Distillation for Efficient Question Answering   in Spanish

Adri\'an Bazaga; Pietro Li\`o; Gos Micklem

arXiv:2312.04193·cs.CL·March 19, 2024·1 cites

Language Model Knowledge Distillation for Efficient Question Answering in Spanish

Adri\'an Bazaga, Pietro Li\`o, Gos Micklem

PDF

Open Access 1 Repo

TL;DR

This paper introduces SpanishTinyRoBERTa, a compressed Spanish language model created through knowledge distillation, enabling efficient question answering with minimal performance loss, suitable for resource-limited environments.

Contribution

The paper presents a novel distilled Spanish language model based on RoBERTa, optimized for question answering and resource efficiency.

Findings

01

The distilled model maintains performance comparable to larger models.

02

Inference speed is significantly increased with minimal accuracy loss.

03

The approach facilitates NLP tasks in resource-constrained settings.

Abstract

Recent advances in the development of pre-trained Spanish language models has led to significant progress in many Natural Language Processing (NLP) tasks, such as question answering. However, the lack of efficient models imposes a barrier for the adoption of such models in resource-constrained environments. Therefore, smaller distilled models for the Spanish language could be proven to be highly scalable and facilitate their further adoption on a variety of tasks and scenarios. In this work, we take one step in this direction by developing SpanishTinyRoBERTa, a compressed language model based on RoBERTa for efficient question answering in Spanish. To achieve this, we employ knowledge distillation from a large model onto a lighter model that allows for a wider implementation, even in areas with limited computational resources, whilst attaining negligible performance sacrifice. Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

adrianbzg/tinyroberta-distillation-qa-es
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsAttention Is All You Need · Linear Layer · Residual Connection · Dropout · Refunds@Expedia|||How do I get a full refund from Expedia? · Softmax · Multi-Head Attention · WordPiece · Adam · Attention Dropout