An Accuracy-Enhanced Stemming Algorithm for Arabic Information Retrieval
Sadik Bessou, Mohamed Touahria

TL;DR
This paper introduces ESAIR, an Arabic stemming algorithm that improves information retrieval accuracy by accurately extracting roots, demonstrated through experiments showing up to 96% accuracy.
Contribution
The paper presents a novel Arabic stemming algorithm that enhances retrieval accuracy by effectively extracting roots using a template-based approach.
Findings
Root extraction accuracy up to 96%
Significant improvement in retrieval relevance
Effective in reducing silence during retrieval
Abstract
This paper provides a method for indexing and retrieving Arabic texts, based on natural language processing. Our approach exploits the notion of template in word stemming and replaces the words by their stems. This technique has proven to be effective since it has returned significant relevant retrieval results by decreasing silence during the retrieval phase. Series of experiments have been conducted to test the performance of the proposed algorithm ESAIR (Enhanced Stemmer for Arabic Information Retrieval). The results obtained indicate that the algorithm extracts the exact root with an accuracy rate up to 96% and hence, improving information retrieval.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsTest
