NepaliGPT: A Generative Language Model for the Nepali Language

Shushanta Pudasaini; Aman Shakya; Siddhartha Shrestha; Sahil Bhatta; Sunil Thapa; Sushmita Palikhe

arXiv:2506.16399·cs.CL·June 23, 2025

NepaliGPT: A Generative Language Model for the Nepali Language

Shushanta Pudasaini, Aman Shakya, Siddhartha Shrestha, Sahil Bhatta, Sunil Thapa, Sushmita Palikhe

PDF

Open Access

TL;DR

This paper introduces NepaliGPT, the first large language model for Nepali, built on a new corpus and benchmark dataset, achieving promising results in text generation and coherence.

Contribution

NepaliGPT is the first Nepali-specific generative language model, utilizing a new corpus and benchmark dataset to advance Nepali NLP research.

Findings

01

Perplexity of 26.32 in text generation

02

ROUGE-1 score of 0.2604

03

Causal coherence of 81.25%

Abstract

After the release of ChatGPT, Large Language Models (LLMs) have gained huge popularity in recent days and thousands of variants of LLMs have been released. However, there is no generative language model for the Nepali language, due to which other downstream tasks, including fine-tuning, have not been explored yet. To fill this research gap in the Nepali NLP space, this research proposes \textit{NepaliGPT}, a generative large language model tailored specifically for the Nepali language. This research introduces an advanced corpus for the Nepali language collected from several sources, called the Devanagari Corpus. Likewise, the research introduces the first NepaliGPT benchmark dataset comprised of 4,296 question-answer pairs in the Nepali language. The proposed LLM NepaliGPT achieves the following metrics in text generation: Perplexity of 26.32245, ROUGE-1 score of 0.2604, causal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText Readability and Simplification · Artificial Intelligence in Healthcare and Education · Topic Modeling