GrammarGPT: Exploring Open-Source LLMs for Native Chinese Grammatical   Error Correction with Supervised Fine-Tuning

Yaxin Fan; Feng Jiang; Peifeng Li; and Haizhou Li

arXiv:2307.13923·cs.CL·August 21, 2023·5 cites

GrammarGPT: Exploring Open-Source LLMs for Native Chinese Grammatical Error Correction with Supervised Fine-Tuning

Yaxin Fan, Feng Jiang, Peifeng Li, and Haizhou Li

PDF

Open Access 1 Repo

TL;DR

GrammarGPT is an open-source Chinese grammatical error correction model that leverages hybrid datasets and instruction tuning, outperforming state-of-the-art systems despite using significantly less training data.

Contribution

This work introduces GrammarGPT, a novel approach that fine-tunes open-source LLMs for Chinese grammatical error correction using hybrid datasets and error-invariant augmentation.

Findings

01

GrammarGPT outperforms existing SOTA systems in Chinese GEC.

02

It requires 1200x less data than comparable models.

03

Achieved 3rd place in NLPCC2023 SharedTask1.

Abstract

Grammatical error correction aims to correct ungrammatical sentences automatically. Recently, some work has demonstrated the excellent capabilities of closed-source Large Language Models (LLMs, e.g., ChatGPT) in grammatical error correction. However, the potential of open-source LLMs remains unexplored. In this paper, we introduced GrammarGPT, an open-source LLM, to preliminary explore its potential for native Chinese grammatical error correction. The core recipe of GrammarGPT is to leverage the hybrid dataset of ChatGPT-generated and human-annotated. For grammatical errors with clues, we proposed a heuristic method to guide ChatGPT to generate ungrammatical sentences by providing those clues. For grammatical errors without clues, we collected ungrammatical sentences from publicly available websites and manually corrected them. In addition, we employed an error-invariant augmentation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

freedomintelligence/grammargpt
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification