Lawyer LLaMA Technical Report

Quzhe Huang; Mingxu Tao; Chen Zhang; Zhenwei An; Cong Jiang; Zhibin; Chen; Zirui Wu; Yansong Feng

arXiv:2305.15062·cs.CL·October 17, 2023·39 cites

Lawyer LLaMA Technical Report

Quzhe Huang, Mingxu Tao, Chen Zhang, Zhenwei An, Cong Jiang, Zhibin, Chen, Zirui Wu, Yansong Feng

PDF

Open Access 1 Repo 2 Models

TL;DR

This paper introduces Lawyer LLaMA, a domain-specific legal language model enhanced with expert knowledge, retrieval mechanisms, and fine-tuning to improve legal reasoning and reduce hallucinations.

Contribution

It presents a novel framework for adapting LLMs to specialized domains, emphasizing expert data over ChatGPT-generated data and integrating retrieval to improve accuracy.

Findings

01

Expert data outperforms ChatGPT data for domain learning

02

Retrieval module reduces hallucinations during generation

03

Lawyer LLaMA achieves improved legal reasoning performance

Abstract

Large Language Models (LLMs), like LLaMA, have exhibited remarkable performance across various tasks. Nevertheless, when deployed to specific domains such as law or medicine, the models still confront the challenge of a deficiency in domain-specific knowledge and an inadequate capability to leverage that knowledge to resolve domain-related problems. In this paper, we propose a new framework to adapt LLMs to specific domains and build Lawyer LLaMA, a legal domain LLM, based on this framework. Specifically, we inject domain knowledge during the continual training stage and teach the model to learn professional skills using properly designed supervised fine-tuning tasks. Moreover, to alleviate the hallucination problem during the model's generation, we add a retrieval module and extract relevant legal articles before the model answers any queries. When learning domain-specific skills, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

andrewzhe/lawyer-llama
pytorchOfficial

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Law · Topic Modeling · Natural Language Processing Techniques

MethodsFocus