LLMCL-GEC: Advancing Grammatical Error Correction with LLM-Driven   Curriculum Learning

Tao Fang; Derek F. Wong; Lusheng Zhang; Keyan Jin; Qiang Zhang,; Tianjiao Li; Jinlong Hou; Lidia S. Chao

arXiv:2412.12541·cs.CL·December 18, 2024

LLMCL-GEC: Advancing Grammatical Error Correction with LLM-Driven Curriculum Learning

Tao Fang, Derek F. Wong, Lusheng Zhang, Keyan Jin, Qiang Zhang,, Tianjiao Li, Jinlong Hou, Lidia S. Chao

PDF

Open Access

TL;DR

This paper introduces LLMCL-GEC, a novel curriculum learning approach that leverages large language models to improve grammatical error correction by mimicking human-like curriculum strategies, resulting in significant performance gains.

Contribution

It proposes a new LLM-based curriculum learning method for GEC, utilizing LLMs to assess data complexity and enhance training effectiveness over traditional methods.

Findings

01

Significant performance improvements on GEC benchmarks.

02

Effective data complexity assessment using LLMs.

03

Outperforms baseline and traditional curriculum learning methods.

Abstract

While large-scale language models (LLMs) have demonstrated remarkable capabilities in specific natural language processing (NLP) tasks, they may still lack proficiency compared to specialized models in certain domains, such as grammatical error correction (GEC). Drawing inspiration from the concept of curriculum learning, we have delved into refining LLMs into proficient GEC experts by devising effective curriculum learning (CL) strategies. In this paper, we introduce a novel approach, termed LLM-based curriculum learning, which capitalizes on the robust semantic comprehension and discriminative prowess inherent in LLMs to gauge the complexity of GEC training data. Unlike traditional curriculum learning techniques, our method closely mirrors human expert-designed curriculums. Leveraging the proposed LLM-based CL method, we sequentially select varying levels of curriculums ranging from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Byte Pair Encoding · Multi-Head Attention · Inverse Square Root Schedule · Layer Normalization · Residual Connection · Gated Linear Unit · SentencePiece