Low Resource Summarization using Pre-trained Language Models

Mubashir Munaf; Hammad Afzal; Naima Iltaf; Khawir Mahmood

arXiv:2310.02790·cs.CL·October 5, 2023·1 cites

Low Resource Summarization using Pre-trained Language Models

Mubashir Munaf, Hammad Afzal, Naima Iltaf, Khawir Mahmood

PDF

Open Access

TL;DR

This paper introduces a method for low-resource language summarization using adapted transformer models, demonstrating effective performance with a new Urdu dataset and a smaller model size.

Contribution

The paper presents a novel adaptation of transformer models for low-resource summarization and constructs a new Urdu dataset for benchmarking.

Findings

01

urT5 achieves up to 46.35 ROUGE-1 score

02

Model size reduced by 44.78% compared to mT5

03

Competitive results comparable to high-resource models

Abstract

With the advent of Deep Learning based Artificial Neural Networks models, Natural Language Processing (NLP) has witnessed significant improvements in textual data processing in terms of its efficiency and accuracy. However, the research is mostly restricted to high-resource languages such as English and low-resource languages still suffer from a lack of available resources in terms of training datasets as well as models with even baseline evaluation results. Considering the limited availability of resources for low-resource languages, we propose a methodology for adapting self-attentive transformer-based architecture models (mBERT, mT5) for low-resource summarization, supplemented by the construction of a new baseline dataset (76.5k article, summary pairs) in a low-resource language Urdu. Choosing news (a publicly available source) as the application domain has the potential to make the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Computational and Text Analysis Methods