Legal Prompting: Teaching a Language Model to Think Like a Lawyer
Fangyi Yu, Lee Quartey, Frank Schilder

TL;DR
This paper investigates how large language models can be adapted for legal reasoning tasks, demonstrating that prompts based on legal reasoning techniques like IRAC significantly improve performance over general prompting methods.
Contribution
The paper introduces the use of legal reasoning-specific prompts such as IRAC for large language models, achieving state-of-the-art results on the COLIEE entailment task.
Findings
IRAC-based prompts outperform general Chain-of-Thought prompts.
Fine-tuning with explanations improves accuracy.
Achieved new state-of-the-art accuracy of 0.8148.
Abstract
Large language models that are capable of zero or few-shot prompting approaches have given rise to the new research area of prompt engineering. Recent advances showed that for example Chain-of-Thought (CoT) prompts can improve arithmetic or common sense tasks significantly. We explore how such approaches fare with legal reasoning tasks and take the COLIEE entailment task based on the Japanese Bar exam for testing zero-shot/few-shot and fine-tuning approaches. Our findings show that while CoT prompting and fine-tuning with explanations approaches show improvements, the best results are produced by prompts that are derived from specific legal reasoning techniques such as IRAC (Issue, Rule, Application, Conclusion). Based on our experiments we improve the 2021 best result from 0.7037 accuracy to 0.8148 accuracy and beat the 2022 best system of 0.6789 accuracy with an accuracy of 0.7431.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Law · Topic Modeling · Legal Education and Practice Innovations
MethodsChain-of-thought prompting
