An Empirical Study of Some Selected IR Models for Bengali Monolingual Information Retrieval
Kamal Sarkar, Avisek Gupta

TL;DR
This paper evaluates the performance of TF-IDF and BM25 models on Bengali monolingual IR tasks using FIRE datasets from 2008 to 2012, providing insights into their effectiveness.
Contribution
It offers an empirical comparison of two IR models specifically for Bengali language retrieval, which is less explored in existing research.
Findings
BM25 outperforms TF-IDF in Bengali IR tasks.
Performance varies across different years of datasets.
Results contribute to better model selection for Bengali IR applications.
Abstract
This paper presents an evaluation and an analysis of some selected information retrieval models for Bengali monolingual information retrieval task. Two models, TF-IDF model and the Okapi BM25 model have been considered for our study. The developed IR models are tested on FIRE ad hoc retrieval data sets released for different years from 2008 to 2012 and the obtained results have been reported in this paper.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInformation Retrieval and Search Behavior · Text and Document Classification Technologies · Advanced Image and Video Retrieval Techniques
