BeliN: A Novel Corpus for Bengali Religious News Headline Generation using Contextual Feature Fusion
Md Osama, Ashim Dey, Kawsar Ahmed, Muhammad Ashad Kabir

TL;DR
This paper introduces BeliN, a new Bengali religious news corpus, and MultiGen, a multi-input model that fuses contextual features with news content to improve headline generation, demonstrating significant performance gains.
Contribution
The study presents a novel Bengali religious news dataset and a multi-input transformer-based model that incorporates contextual features for improved headline generation.
Findings
MultiGen outperforms baseline models in BLEU and ROUGE scores.
Incorporating contextual features enhances headline generation quality.
The dataset and code are publicly available for further research.
Abstract
Automatic text summarization, particularly headline generation, remains a critical yet underexplored area for Bengali religious news. Existing approaches to headline generation typically rely solely on the article content, overlooking crucial contextual features such as sentiment, category, and aspect. This limitation significantly hinders their effectiveness and overall performance. This study addresses this limitation by introducing a novel corpus, BeliN (Bengali Religious News) - comprising religious news articles from prominent Bangladeshi online newspapers, and MultiGen - a contextual multi-input feature fusion headline generation approach. Leveraging transformer-based pre-trained language models such as BanglaT5, mBART, mT5, and mT0, MultiGen integrates additional contextual features - including category, aspect, and sentiment - with the news content. This fusion enables the model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Sentiment Analysis and Opinion Mining · Advanced Text Analysis Techniques
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Byte Pair Encoding · SentencePiece · Linear Layer · Softmax · Dense Connections · Dropout · Gated Linear Unit · Inverse Square Root Schedule
