Stemmer for Serbian language
Nikola Milo\v{s}evi\'c

TL;DR
This paper introduces a suffix stripping stemmer specifically designed for the highly inflectional Serbian language to improve linguistic processing and information retrieval tasks.
Contribution
It presents the first suffix stripping stemmer tailored for Serbian, addressing the challenges posed by its complex inflectional morphology.
Findings
Effective reduction of Serbian words to their stems
Improved accuracy in information retrieval tasks
Potential for adaptation to other inflectional languages
Abstract
In linguistic morphology and information retrieval, stemming is the process for reducing inflected (or sometimes derived) words to their stem, base or root form; generally a written word form. In this work is presented suffix stripping stemmer for Serbian language, one of the highly inflectional languages.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLinguistics, Language Diversity, and Identity
