Attentive Deep Neural Networks for Legal Document Retrieval
Ha-Thanh Nguyen, Manh-Kien Phi, Xuan-Bach Ngo, Vu Tran, Le-Minh, Nguyen, Minh-Phuong Tu

TL;DR
This paper introduces attentive neural network architectures, including Attentive CNN and Paraformer, to improve legal document retrieval by effectively representing long legal texts across multiple languages, outperforming existing methods.
Contribution
It proposes novel hierarchical neural architectures with sparse attention mechanisms tailored for long legal texts, demonstrating superior retrieval performance over existing models.
Findings
Attentive neural methods outperform non-neural methods in retrieval accuracy.
Pretrained transformers excel on small datasets but are computationally intensive.
Paraformer achieves state-of-the-art results on COLIEE dataset.
Abstract
Legal text retrieval serves as a key component in a wide range of legal text processing tasks such as legal question answering, legal case entailment, and statute law retrieval. The performance of legal text retrieval depends, to a large extent, on the representation of text, both query and legal documents. Based on good representations, a legal text retrieval model can effectively match the query to its relevant documents. Because legal documents often contain long articles and only some parts are relevant to queries, it is quite a challenge for existing models to represent such documents. In this paper, we study the use of attentive neural network-based text representation for statute law document retrieval. We propose a general approach using deep neural networks with attention mechanisms. Based on it, we develop two hierarchical architectures with sparse attention to represent long…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Artificial Intelligence in Law · Natural Language Processing Techniques
