InternLM-Law: An Open Source Chinese Legal Large Language Model

Zhiwei Fei; Songyang Zhang; Xiaoyu Shen; Dawei Zhu; Xiao Wang; Maosong; Cao; Fengzhe Zhou; Yining Li; Wenwei Zhang; Dahua Lin; Kai Chen; Jidong Ge

arXiv:2406.14887·cs.CL·June 24, 2024·2 cites

InternLM-Law: An Open Source Chinese Legal Large Language Model

Zhiwei Fei, Songyang Zhang, Xiaoyu Shen, Dawei Zhu, Xiao Wang, Maosong, Cao, Fengzhe Zhou, Yining Li, Wenwei Zhang, Dahua Lin, Kai Chen, Jidong Ge

PDF

Open Access 1 Repo

TL;DR

InternLM-Law is a specialized Chinese legal large language model trained on a large, high-quality legal dataset, achieving top performance on legal benchmarks and aiding future legal AI research.

Contribution

The paper introduces InternLM-Law, a novel two-stage fine-tuning approach and a large legal dataset, advancing Chinese legal LLM capabilities beyond existing models.

Findings

01

Achieves highest average performance on LawBench

02

Outperforms GPT-4 on 13 out of 20 subtasks

03

Provides publicly available legal LLM and dataset

Abstract

While large language models (LLMs) have showcased impressive capabilities, they struggle with addressing legal queries due to the intricate complexities and specialized expertise required in the legal field. In this paper, we introduce InternLM-Law, a specialized LLM tailored for addressing diverse legal queries related to Chinese laws, spanning from responding to standard legal questions (e.g., legal exercises in textbooks) to analyzing complex real-world legal situations. We meticulously construct a dataset in the Chinese legal domain, encompassing over 1 million queries, and implement a data filtering and processing pipeline to ensure its diversity and quality. Our training approach involves a novel two-stage process: initially fine-tuning LLMs on both legal-specific and general-purpose content to equip the models with broad knowledge, followed by exclusive fine-tuning on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

internlm/internlm-law
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Law

MethodsAttention Is All You Need · Softmax · Layer Normalization · Absolute Position Encodings · Byte Pair Encoding · Label Smoothing · Position-Wise Feed-Forward Layer · Dropout · Adam · Linear Layer