Investigating the Shortcomings of LLMs in Step-by-Step Legal Reasoning
Venkatesh Mishra, Bimsara Pathiraja, Mihir Parmar, Sat Chidananda,, Jayanth Srinivasa, Gaowen Liu, Ali Payani, Chitta Baral

TL;DR
This paper analyzes the reasoning errors of large language models in legal reasoning tasks, proposing a new error taxonomy and an automated evaluation framework to identify specific reasoning shortcomings and improve model performance.
Contribution
It introduces a detailed error taxonomy for legal reasoning in LLMs and develops an automated framework for step-by-step error analysis, advancing beyond overall accuracy metrics.
Findings
LLMs exhibit specific reasoning errors in legal tasks
The auto-evaluator framework effectively identifies errors in reasoning chains
Incorporating error feedback marginally improves LLM performance
Abstract
Reasoning abilities of LLMs have been a key focus in recent years. One challenging reasoning domain with interesting nuances is legal reasoning, which requires careful application of rules, and precedents while balancing deductive and analogical reasoning, and conflicts between rules. Although there have been a few works on using LLMs for legal reasoning, their focus has been on overall accuracy. In this paper, we dig deeper to do a step-by-step analysis and figure out where they commit errors. We use the college-level Multiple Choice Question-Answering (MCQA) task from the \textit{Civil Procedure} dataset and propose a new error taxonomy derived from initial manual analysis of reasoning chains with respect to several LLMs, including two objective measures: soundness and correctness scores. We then develop an LLM-based automated evaluation framework to identify reasoning errors and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsArtificial Intelligence in Law · Law, AI, and Intellectual Property · Legal Education and Practice Innovations
MethodsFocus
