Legal Case Document Similarity: You Need Both Network and Text
Paheli Bhattacharya, Kripabandhu Ghosh, Arindam Pal, Saptarshi Ghosh

TL;DR
This paper introduces Hier-SPCNet, a novel approach combining network and text analysis to improve legal case document similarity estimation, validated through extensive experiments with Indian judiciary data.
Contribution
Proposes Hier-SPCNet, integrating statutes into network models and unifying network and text signals for enhanced legal document similarity measurement.
Findings
Significantly improves correlation with legal experts' opinions.
Outperforms existing network-based and text-based methods.
Effective in recommending similar and citable cases.
Abstract
Estimating the similarity between two legal case documents is an important and challenging problem, having various downstream applications such as prior-case retrieval and citation recommendation. There are two broad approaches for the task -- citation network-based and text-based. Prior citation network-based approaches consider citations only to prior-cases (also called precedents) (PCNet). This approach misses important signals inherent in Statutes (written laws of a jurisdiction). In this work, we propose Hier-SPCNet that augments PCNet with a heterogeneous network of Statutes. We incorporate domain knowledge for legal document similarity into Hier-SPCNet, thereby obtaining state-of-the-art results for network-based legal document similarity. Both textual and network similarity provide important signals for legal case similarity; but till now, only trivial attempts have been made to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Law · Legal Education and Practice Innovations · Topic Modeling
