Low Resource Summarization using Pre-trained Language Models
Mubashir Munaf, Hammad Afzal, Naima Iltaf, Khawir Mahmood

TL;DR
This paper introduces a method for low-resource language summarization using adapted transformer models, demonstrating effective performance with a new Urdu dataset and a smaller model size.
Contribution
The paper presents a novel adaptation of transformer models for low-resource summarization and constructs a new Urdu dataset for benchmarking.
Findings
urT5 achieves up to 46.35 ROUGE-1 score
Model size reduced by 44.78% compared to mT5
Competitive results comparable to high-resource models
Abstract
With the advent of Deep Learning based Artificial Neural Networks models, Natural Language Processing (NLP) has witnessed significant improvements in textual data processing in terms of its efficiency and accuracy. However, the research is mostly restricted to high-resource languages such as English and low-resource languages still suffer from a lack of available resources in terms of training datasets as well as models with even baseline evaluation results. Considering the limited availability of resources for low-resource languages, we propose a methodology for adapting self-attentive transformer-based architecture models (mBERT, mT5) for low-resource summarization, supplemented by the construction of a new baseline dataset (76.5k article, summary pairs) in a low-resource language Urdu. Choosing news (a publicly available source) as the application domain has the potential to make the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Computational and Text Analysis Methods
