An Empirical Study on Transfer Learning for Privilege Review
Haozhen Zhao, Shi Ye, Jingchao Yang

TL;DR
This study evaluates the effectiveness of transfer learning, especially BERT, in identifying privileged legal documents, demonstrating improved performance over traditional methods across multiple datasets.
Contribution
It provides empirical evidence that transfer learning with BERT enhances privilege document classification in legal review, outperforming standard logistic regression models.
Findings
BERT outperforms logistic regression in privilege classification.
Transfer learning achieves decent results on similar domain datasets.
Models perform well with limited new training data.
Abstract
Protecting privileged communications and data from inadvertent disclosure is a paramount task in the US legal practice. Traditionally counsels rely on keyword searching and manual review to identify privileged documents in cases. As data volumes increase, this approach becomes less and less defensible in costs. Machine learning methods have been used in identifying privilege documents. Given the generalizable nature of privilege in legal cases, we hypothesize that transfer learning can capitalize knowledge learned from existing labeled data to identify privilege documents without requiring labeling new training data. In this paper, we study both traditional machine learning models and deep learning models based on BERT for privilege document classification tasks in legal document review, and we examine the effectiveness of transfer learning in privilege model on three real world…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLegal Education and Practice Innovations · Artificial Intelligence in Law · Law, Economics, and Judicial Systems
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Layer Normalization · Residual Connection · Softmax · Logistic Regression · WordPiece · Adam
