"When Data is Scarce, Prompt Smarter"... Approaches to Grammatical Error Correction in Low-Resource Settings
Somsubhra De, Harsh Kumar, Arun Prakash A

TL;DR
This paper demonstrates that prompting large language models with few-shot strategies significantly improves grammatical error correction in low-resource Indic languages, outperforming traditional fine-tuned models.
Contribution
It introduces prompt-based approaches with large language models for low-resource GEC, showing substantial performance gains over fine-tuned models in Indic languages.
Findings
LLMs outperform fine-tuned models in low-resource GEC tasks
Carefully designed prompts enhance correction quality
Achieved top rankings in multiple Indic language GEC shared tasks
Abstract
Grammatical error correction (GEC) is an important task in Natural Language Processing that aims to automatically detect and correct grammatical mistakes in text. While recent advances in transformer-based models and large annotated datasets have greatly improved GEC performance for high-resource languages such as English, the progress has not extended equally. For most Indic languages, GEC remains a challenging task due to limited resources, linguistic diversity and complex morphology. In this work, we explore prompting-based approaches using state-of-the-art large language models (LLMs), such as GPT-4.1, Gemini-2.5 and LLaMA-4, combined with few-shot strategy to adapt them to low-resource settings. We observe that even basic prompting strategies, such as zero-shot and few-shot approaches, enable these LLMs to substantially outperform fine-tuned Indic-language models like Sarvam-22B,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
