Enhancing Hindi NER in Low Context: A Comparative study of Transformer-based models with vs. without Retrieval Augmentation

Sumit Singh; Rohit Mishra; Uma Shanker Tiwary

arXiv:2507.16002·cs.CL·July 23, 2025

Enhancing Hindi NER in Low Context: A Comparative study of Transformer-based models with vs. without Retrieval Augmentation

Sumit Singh, Rohit Mishra, Uma Shanker Tiwary

PDF

Open Access

TL;DR

This study evaluates transformer-based models for Hindi NER, demonstrating that retrieval augmentation improves performance, especially in low-context scenarios, with fine-tuned models outperforming non-fine-tuned ones.

Contribution

It provides a comparative analysis of pretrained encoders and generative models with and without retrieval augmentation for Hindi NER, highlighting effective strategies for resource-limited languages.

Findings

01

Retrieval augmentation improves macro F1 scores for Hindi NER.

02

Fine-tuned models outperform non-fine-tuned generative models.

03

RA benefits are more significant in low-context data scenarios.

Abstract

One major challenge in natural language processing is named entity recognition (NER), which identifies and categorises named entities in textual input. In order to improve NER, this study investigates a Hindi NER technique that makes use of Hindi-specific pretrained encoders (MuRIL and XLM-R) and Generative Models ( Llama-2-7B-chat-hf (Llama2-7B), Llama-2-70B-chat-hf (Llama2-70B), Llama-3-70B-Instruct (Llama3-70B) and GPT3.5-turbo), and augments the data with retrieved data from external relevant contexts, notably from Wikipedia. We have fine-tuned MuRIL, XLM-R and Llama2-7B with and without RA. However, Llama2-70B, lama3-70B and GPT3.5-turbo are utilised for few-shot NER generation. Our investigation shows that the mentioned language models (LMs) with Retrieval Augmentation (RA) outperform baseline methods that don't incorporate RA in most cases. The macro F1 scores for MuRIL and XLM-R…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBrain Tumor Detection and Classification · Speech Recognition and Synthesis · Neural Networks and Applications