A Novel Approach for Automatic Bengali Question Answering System using Semantic Similarity Analysis
Arijit Das, Jaydeep Mandal, Zargham Danial, Alok Ranjan Pal, Diganta, Saha

TL;DR
This paper presents a semantic similarity-based approach for Bengali question answering, ranking answers effectively without relying on shared keywords, achieving high accuracy on a large dataset.
Contribution
It introduces a novel semantic similarity algorithm for Bengali QA systems combining statistical parameters, entropy, and sense scoring, with extensive testing on a large dataset.
Findings
82.3% of answers ranked first
90.0% of answers within top 5
97.32% accuracy achieved
Abstract
Finding the semantically accurate answer is one of the key challenges in advanced searching. In contrast to keyword-based searching, the meaning of a question or query is important here and answers are ranked according to relevance. It is very natural that there is almost no common word between the question sentence and the answer sentence. In this paper, an approach is described to find out the semantically relevant answers in the Bengali dataset. In the first part of the algorithm, a set of statistical parameters like frequency, index, part-of-speech (POS), etc. is matched between a question and the probable answers. In the second phase, entropy and similarity are calculated in different modules. Finally, a sense score is generated to rank the answers. The algorithm is tested on a repository containing a total of 275000 sentences. This Bengali repository is a product of Technology…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
