Lawyer LLaMA Technical Report
Quzhe Huang, Mingxu Tao, Chen Zhang, Zhenwei An, Cong Jiang, Zhibin, Chen, Zirui Wu, Yansong Feng

TL;DR
This paper introduces Lawyer LLaMA, a domain-specific legal language model enhanced with expert knowledge, retrieval mechanisms, and fine-tuning to improve legal reasoning and reduce hallucinations.
Contribution
It presents a novel framework for adapting LLMs to specialized domains, emphasizing expert data over ChatGPT-generated data and integrating retrieval to improve accuracy.
Findings
Expert data outperforms ChatGPT data for domain learning
Retrieval module reduces hallucinations during generation
Lawyer LLaMA achieves improved legal reasoning performance
Abstract
Large Language Models (LLMs), like LLaMA, have exhibited remarkable performance across various tasks. Nevertheless, when deployed to specific domains such as law or medicine, the models still confront the challenge of a deficiency in domain-specific knowledge and an inadequate capability to leverage that knowledge to resolve domain-related problems. In this paper, we propose a new framework to adapt LLMs to specific domains and build Lawyer LLaMA, a legal domain LLM, based on this framework. Specifically, we inject domain knowledge during the continual training stage and teach the model to learn professional skills using properly designed supervised fine-tuning tasks. Moreover, to alleviate the hallucination problem during the model's generation, we add a retrieval module and extract relevant legal articles before the model answers any queries. When learning domain-specific skills, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Law · Topic Modeling · Natural Language Processing Techniques
MethodsFocus
