Bengali Abstractive News Summarization(BANS): A Neural Attention   Approach

Prithwiraj Bhattacharjee; Avi Mallick; Md Saiful Islam,; Marium-E-Jannat

arXiv:2012.01747·cs.CL·December 18, 2020

Bengali Abstractive News Summarization(BANS): A Neural Attention Approach

Prithwiraj Bhattacharjee, Avi Mallick, Md Saiful Islam,, Marium-E-Jannat

PDF

1 Repo

TL;DR

This paper introduces a Bengali abstractive news summarization system using a seq2seq LSTM model with attention, trained on a large dataset, achieving significant improvements over previous methods.

Contribution

The paper presents the first large-scale Bengali news summarization dataset and a novel attention-based seq2seq LSTM model that outperforms existing approaches.

Findings

01

Significant improvement in human evaluation scores.

02

Largest publicly available Bengali news summarization dataset.

03

Effective attention mechanism producing human-like summaries.

Abstract

Abstractive summarization is the process of generating novel sentences based on the information extracted from the original text document while retaining the context. Due to abstractive summarization's underlying complexities, most of the past research work has been done on the extractive summarization approach. Nevertheless, with the triumph of the sequence-to-sequence (seq2seq) model, abstractive summarization becomes more viable. Although a significant number of notable research has been done in the English language based on abstractive summarization, only a couple of works have been done on Bengali abstractive news summarization (BANS). In this article, we presented a seq2seq based Long Short-Term Memory (LSTM) network model with attention at encoder-decoder. Our proposed system deploys a local attention-based model that produces a long sequence of words with lucid and human-like…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Prithwiraj12/Bengali-Deep-News-Summarization
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsTanh Activation · Sigmoid Activation · Long Short-Term Memory · Sequence to Sequence