Language Model Knowledge Distillation for Efficient Question Answering in Spanish
Adri\'an Bazaga, Pietro Li\`o, Gos Micklem

TL;DR
This paper introduces SpanishTinyRoBERTa, a compressed Spanish language model created through knowledge distillation, enabling efficient question answering with minimal performance loss, suitable for resource-limited environments.
Contribution
The paper presents a novel distilled Spanish language model based on RoBERTa, optimized for question answering and resource efficiency.
Findings
The distilled model maintains performance comparable to larger models.
Inference speed is significantly increased with minimal accuracy loss.
The approach facilitates NLP tasks in resource-constrained settings.
Abstract
Recent advances in the development of pre-trained Spanish language models has led to significant progress in many Natural Language Processing (NLP) tasks, such as question answering. However, the lack of efficient models imposes a barrier for the adoption of such models in resource-constrained environments. Therefore, smaller distilled models for the Spanish language could be proven to be highly scalable and facilitate their further adoption on a variety of tasks and scenarios. In this work, we take one step in this direction by developing SpanishTinyRoBERTa, a compressed language model based on RoBERTa for efficient question answering in Spanish. To achieve this, we employ knowledge distillation from a large model onto a lighter model that allows for a wider implementation, even in areas with limited computational resources, whilst attaining negligible performance sacrifice. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsAttention Is All You Need · Linear Layer · Residual Connection · Dropout · Refunds@Expedia|||How do I get a full refund from Expedia? · Softmax · Multi-Head Attention · WordPiece · Adam · Attention Dropout
