CNN Application in Detection of Privileged Documents in Legal Document Review
Rishi Chhatwal, Robert Keeling, Peter Gronvall, Nathaniel, Huber-Fliflet, Jianping Zhang, Haozhen Zhao

TL;DR
This paper introduces a CNN-based approach combined with keyword searching to improve the accuracy of identifying privileged legal documents, reducing false positives while maintaining high true positive rates.
Contribution
The study proposes a novel CNN method that leverages keyword context and a new training data selection process for privileged document detection.
Findings
Significantly reduces false positives in privileged document detection.
Maintains high true positive capture rates.
Outperforms traditional keyword and text classification methods.
Abstract
Protecting privileged communications and data from disclosure is paramount for legal teams. Legal advice, such as attorney-client communications or litigation strategy are typically exempt from disclosure in litigations or regulatory events and are vital to the attorney-client relationship. To protect this information from disclosure, companies and outside counsel often review vast amounts of documents to determine those that contain privileged material. This process is extremely costly and time consuming. As data volumes increase, legal counsel normally employs methods to reduce the number of documents requiring review while balancing the need to ensure the protection of privileged information. Keyword searching is relied upon as a method to target privileged information and reduce document review populations. Keyword searches are effective at casting a wide net but often return overly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Text Analysis Techniques · Topic Modeling · Sentiment Analysis and Opinion Mining
