Chinese Labor Law Large Language Model Benchmark

Zixun Lan; Maochun Xu; Yifan Ren; Rui Wu; Jianghui Zhou; Xueyang Cheng; Jianan Ding Ding; Xinheng Wang; Mingmin Chi; Fei Ma

arXiv:2601.09972·cs.AI·January 16, 2026

Chinese Labor Law Large Language Model Benchmark

Zixun Lan, Maochun Xu, Yifan Ren, Rui Wu, Jianghui Zhou, Xueyang Cheng, Jianan Ding Ding, Xinheng Wang, Mingmin Chi, Fei Ma

PDF

Open Access

TL;DR

This paper introduces LabourLawLLM, a specialized Chinese labor law large language model, and LabourLawBench, a comprehensive benchmark for evaluating legal AI performance in labor law tasks, demonstrating superior results over general models.

Contribution

The paper presents a tailored legal LLM for Chinese labor law and a new benchmark, advancing specialized legal AI capabilities and evaluation methods.

Findings

01

LabourLawLLM outperforms general-purpose models in labor law tasks.

02

The benchmark effectively evaluates legal LLMs across diverse tasks.

03

Methodology scalable to other legal subfields.

Abstract

Recent advances in large language models (LLMs) have led to substantial progress in domain-specific applications, particularly within the legal domain. However, general-purpose models such as GPT-4 often struggle with specialized subdomains that require precise legal knowledge, complex reasoning, and contextual sensitivity. To address these limitations, we present LabourLawLLM, a legal large language model tailored to Chinese labor law. We also introduce LabourLawBench, a comprehensive benchmark covering diverse labor-law tasks, including legal provision citation, knowledge-based question answering, case classification, compensation computation, named entity recognition, and legal case analysis. Our evaluation framework combines objective metrics (e.g., ROUGE-L, accuracy, F1, and soft-F1) with subjective assessment based on GPT-4 scoring. Experiments show that LabourLawLLM consistently…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Law · Topic Modeling · Ethics and Social Impacts of AI