BanglaSarc: A Dataset for Sarcasm Detection
Tasnim Sakib Apon, Ramisa Anan, Elizabeth Antora Modhu, Arjun Suter,, Ifrit Jamal Sneha, MD. Golam Rabiul Alam

TL;DR
BanglaSarc is a newly created, publicly available dataset of over 5000 Bengali social media comments designed to advance sarcasm detection and related emotional analysis in the Bengali language.
Contribution
This paper introduces BanglaSarc, the first large-scale dataset specifically for sarcasm detection in Bengali social media content.
Findings
Dataset contains 5112 comments from social media platforms
Facilitates research in sarcasm detection and emotion recognition in Bengali
Supports further development of NLP tools for Bengali language
Abstract
Being one of the most widely spoken language in the world, the use of Bangla has been increasing in the world of social media as well. Sarcasm is a positive statement or remark with an underlying negative motivation that is extensively employed in today's social media platforms. There has been a significant improvement in sarcasm detection in English over the previous many years, however the situation regarding Bangla sarcasm detection remains unchanged. As a result, it is still difficult to identify sarcasm in bangla, and a lack of high-quality data is a major contributing factor. This article proposes BanglaSarc, a dataset constructed specifically for bangla textual data sarcasm detection. This dataset contains of 5112 comments/status and contents collected from various online social platforms such as Facebook, YouTube, along with a few online blogs. Due to the limited amount of data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Educational Methods and Technology · Text and Document Classification Technologies
