Regulatory Compliance through Doc2Doc Information Retrieval: A case study in EU/UK legislation where text similarity has limitations
Ilias Chalkidis, Manos Fergadiotis, Nikolaos Manginas, Eva Katakalou, and Prodromos Malakasiotis

TL;DR
This paper explores document-to-document information retrieval for regulatory compliance, highlighting challenges with text similarity, and demonstrates the effectiveness of BERT-based models and temporal filtering in EU/UK legislative datasets.
Contribution
It introduces REG-IR, a novel document retrieval approach for legal texts, and provides new datasets and insights into model performance and limitations.
Findings
Fine-tuned BERT models yield the best IR representations.
Neural re-rankers underperform due to conflicting supervision.
Applying date filters improves retrieval performance.
Abstract
Major scandals in corporate history have urged the need for regulatory compliance, where organizations need to ensure that their controls (processes) comply with relevant laws, regulations, and policies. However, keeping track of the constantly changing legislation is difficult, thus organizations are increasingly adopting Regulatory Technology (RegTech) to facilitate the process. To this end, we introduce regulatory information retrieval (REG-IR), an application of document-to-document information retrieval (DOC2DOC IR), where the query is an entire document making the task more challenging than traditional IR where the queries are short. Furthermore, we compile and release two datasets based on the relationships between EU directives and UK legislation. We experiment on these datasets using a typical two-step pipeline approach comprising a pre-fetcher and a neural re-ranker.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLinear Layer · Layer Normalization · Refunds@Expedia|||How do I get a full refund from Expedia? · Residual Connection · WordPiece · Attention Dropout · Attention Is All You Need · Dense Connections · Adam · Linear Warmup With Linear Decay
