Pointer over Attention: An Improved Bangla Text Summarization Approach Using Hybrid Pointer Generator Network
Nobel Dhar, Gaurob Saha, Prithwiraj Bhattacharjee, Avi Mallick, Md, Saiful Islam

TL;DR
This paper introduces a hybrid pointer generator network with coverage mechanism to improve Bangla text summarization, reducing repetition and enhancing factual accuracy, outperforming previous models on large datasets.
Contribution
The paper presents a novel hybrid pointer generator model with coverage for Bangla summarization, achieving state-of-the-art results and demonstrating stability on large-scale datasets.
Findings
Achieved ROUGE-1 score of 0.66 on BANSData
Achieved ROUGE-2 score of 0.42 on BANS-133k
Surpassed previous Bengali summarization techniques
Abstract
Despite the success of the neural sequence-to-sequence model for abstractive text summarization, it has a few shortcomings, such as repeating inaccurate factual details and tending to repeat themselves. We propose a hybrid pointer generator network to solve the shortcomings of reproducing factual details inadequately and phrase repetition. We augment the attention-based sequence-to-sequence using a hybrid pointer generator network that can generate Out-of-Vocabulary words and enhance accuracy in reproducing authentic details and a coverage mechanism that discourages repetition. It produces a reasonable-sized output text that preserves the conceptual integrity and factual information of the input article. For evaluation, we primarily employed "BANSData" - a highly adopted publicly available Bengali dataset. Additionally, we prepared a large-scale dataset called "BANS-133" which consists…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
