CLASE: A Hybrid Method for Chinese Legalese Stylistic Evaluation
Yiran Rex Ma, Yuxiao Ye, Huiyuan Xie

TL;DR
CLASE is a hybrid evaluation method combining linguistic features and LLM judgments to assess Chinese legal text style, outperforming traditional metrics and providing interpretable feedback.
Contribution
Introduces CLASE, a novel hybrid, reference-free evaluation approach for legal stylistic quality that integrates feature-based scores with LLM judgments, learned from contrastive legal document pairs.
Findings
Higher alignment with human judgments than traditional metrics
Provides interpretable scores and improvement suggestions
Scalable and practical for legal text style evaluation
Abstract
Legal text generated by large language models (LLMs) can usually achieve reasonable factual accuracy, but it frequently fails to adhere to the specialised stylistic norms and linguistic conventions of legal writing. In order to improve stylistic quality, a crucial first step is to establish a reliable evaluation method. However, having legal experts manually develop such a metric is impractical, as the implicit stylistic requirements in legal writing practice are difficult to formalise into explicit rubrics. Meanwhile, existing automatic evaluation methods also fall short: reference-based metrics conflate semantic accuracy with stylistic fidelity, and LLM-as-a-judge evaluations suffer from opacity and inconsistency. To address these challenges, we introduce CLASE (Chinese LegAlese Stylistic Evaluation), a hybrid evaluation method that focuses on the stylistic performance of legal text.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Artificial Intelligence in Law · Text Readability and Simplification
