CaseEncoder: A Knowledge-enhanced Pre-trained Model for Legal Case   Encoding

Yixiao Ma; Yueyue Wu; Weihang Su; Qingyao Ai; Yiqun Liu

arXiv:2305.05393·cs.IR·May 10, 2023·1 cites

CaseEncoder: A Knowledge-enhanced Pre-trained Model for Legal Case Encoding

Yixiao Ma, Yueyue Wu, Weihang Su, Qingyao Ai, Yiqun Liu

PDF

Open Access

TL;DR

CaseEncoder is a novel legal case encoding model that incorporates fine-grained legal knowledge during data sampling and pre-training, significantly improving zero-shot legal case retrieval performance.

Contribution

It introduces a legal-specific pre-training framework with knowledge-guided data sampling and a Biased Circle Loss, advancing legal document encoding methods.

Findings

01

Outperforms existing models on multiple benchmarks

02

Enhances zero-shot legal case retrieval accuracy

03

Utilizes legal knowledge to improve relevance recognition

Abstract

Legal case retrieval is a critical process for modern legal information systems. While recent studies have utilized pre-trained language models (PLMs) based on the general domain self-supervised pre-training paradigm to build models for legal case retrieval, there are limitations in using general domain PLMs as backbones. Specifically, these models may not fully capture the underlying legal features in legal case documents. To address this issue, we propose CaseEncoder, a legal document encoder that leverages fine-grained legal knowledge in both the data sampling and pre-training phases. In the data sampling phase, we enhance the quality of the training data by utilizing fine-grained law article information to guide the selection of positive and negative examples. In the pre-training phase, we design legal-specific pre-training tasks that align with the judging criteria of relevant…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Law · Legal Education and Practice Innovations · Comparative and International Law Studies

MethodsALIGN