Evaluation of some Information Retrieval models for Gujarati Ad hoc   Monolingual Tasks

Hardik J. Joshi; Pareek Jyoti

arXiv:1209.0126·cs.IR·January 22, 2020·5 cites

Evaluation of some Information Retrieval models for Gujarati Ad hoc Monolingual Tasks

Hardik J. Joshi, Pareek Jyoti

PDF

Open Access

TL;DR

This study evaluates various Information Retrieval models for Gujarati monolingual tasks, finding classical models like TF_IDF outperform recent probabilistic models, providing insights into effective IR approaches for Gujarati language processing.

Contribution

It presents a baseline for Gujarati IR and compares multiple models, highlighting the superior performance of classical IR models over recent probabilistic ones.

Findings

01

TF_IDF outperforms recent probabilistic models for Gujarati IR

02

Established a baseline with MAP values for Gujarati IR models

03

Identified the most effective IR models for Gujarati language

Abstract

This paper describes the work towards Gujarati Ad hoc Monolingual Retrieval task for widely used Information Retrieval (IR) models. We present an indexing baseline for the Gujarati Language represented by Mean Average Precision (MAP) values. Our objective is to obtain a relative picture of a better IR model for Gujarati Language. Results show that Classical IR models like Term Frequency Inverse Document Frequency (TF_IDF) performs better when compared to few recent probabilistic IR models. The experiments helped to identify the outperforming IR models for Gujarati Language.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInformation Retrieval and Search Behavior · Topic Modeling · Algorithms and Data Compression