Classification of US Supreme Court Cases using BERT-Based Techniques
Shubham Vatsal, Adam Meyers, and John E. Ortega

TL;DR
This paper explores BERT-based models for classifying US Supreme Court decisions, achieving improved accuracy over previous state-of-the-art results on both broad and fine-grained categorization tasks.
Contribution
It introduces BERT-based classification techniques tailored for long legal documents and demonstrates significant accuracy improvements over prior models.
Findings
Achieved 80% accuracy on 15 broad categories
Achieved 60% accuracy on 279 fine-grained categories
Outperformed previous SOTA results by 8% and 28% respectively
Abstract
Models based on bidirectional encoder representations from transformers (BERT) produce state of the art (SOTA) results on many natural language processing (NLP) tasks such as named entity recognition (NER), part-of-speech (POS) tagging etc. An interesting phenomenon occurs when classifying long documents such as those from the US supreme court where BERT-based models can be considered difficult to use on a first-pass or out-of-the-box basis. In this paper, we experiment with several BERT-based classification techniques for US supreme court decisions or supreme court database (SCDB) and compare them with the previous SOTA results. We then compare our results specifically with SOTA models for long documents. We compare our results for two classification tasks: (1) a broad classification task with 15 categories and (2) a fine-grained classification task with 279 categories. Our best result…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Law · Judicial and Constitutional Studies · Legal Education and Practice Innovations
