An Empirical Study of Some Selected IR Models for Bengali Monolingual   Information Retrieval

Kamal Sarkar; Avisek Gupta

arXiv:1706.03266·cs.IR·June 13, 2017·1 cites

An Empirical Study of Some Selected IR Models for Bengali Monolingual Information Retrieval

Kamal Sarkar, Avisek Gupta

PDF

Open Access

TL;DR

This paper evaluates the performance of TF-IDF and BM25 models on Bengali monolingual IR tasks using FIRE datasets from 2008 to 2012, providing insights into their effectiveness.

Contribution

It offers an empirical comparison of two IR models specifically for Bengali language retrieval, which is less explored in existing research.

Findings

01

BM25 outperforms TF-IDF in Bengali IR tasks.

02

Performance varies across different years of datasets.

03

Results contribute to better model selection for Bengali IR applications.

Abstract

This paper presents an evaluation and an analysis of some selected information retrieval models for Bengali monolingual information retrieval task. Two models, TF-IDF model and the Okapi BM25 model have been considered for our study. The developed IR models are tested on FIRE ad hoc retrieval data sets released for different years from 2008 to 2012 and the obtained results have been reported in this paper.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInformation Retrieval and Search Behavior · Text and Document Classification Technologies · Advanced Image and Video Retrieval Techniques