TL;DR
This paper systematically evaluates lightweight large language models for court view generation and charge prediction, revealing their potential in judicial AI with a new evaluation framework.
Contribution
It introduces CVGEvalKit, a comprehensive evaluation framework and datasets for assessing lightweight LLMs in legal AI tasks, and provides insights into their performance trade-offs.
Findings
Lightweight LLMs can effectively generate court views and predict charges.
Model architecture and size significantly influence performance.
Lightweight LLMs outperform DNNs in these legal tasks.
Abstract
Criminal Court View Generation (CVG) is a critical task in Legal Artificial Intelligence (Legal AI), involving the generation of court view based on case facts. In this work, we systematically explore the capabilities of lightweight (smaller than 2B) large language models (LLMs) in CVG and their impact on charge prediction. Our study addresses four key questions: (1) how does different architecture of LLMs affect the CVG quality and charge prediction. (2) how does LLMs size contribute to the performance, (3) how do lightweight LLMs compare with Deep Neural Networks (DNNs) in these tasks, and (4) how does predicting charge by court view generation first compare with predicting it directly. Additionally, we also develop CVGEvalKit, an evaluation framework including three public available datasets for CVG tasks, as well as predicting their charges. Comprehensive experiments are conducted…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
