ComplianceNLP: Knowledge-Graph-Augmented RAG for Multi-Framework Regulatory Gap Detection
Dongxin Guo, Jikun Wu, and Siu Ming Yiu

TL;DR
ComplianceNLP is an automated system that leverages knowledge graphs and multi-task learning to detect regulatory compliance gaps, significantly reducing manual effort and improving accuracy in financial regulation monitoring.
Contribution
The paper introduces a novel knowledge-graph-augmented RAG pipeline combined with multi-task obligation extraction, achieving state-of-the-art performance in regulatory gap detection.
Findings
87.7 F1 on gap detection, outperforming GPT-4o+RAG by +3.5 F1
94.2% grounding accuracy in regulatory references
96.0% recall and 90.7% precision in real-world deployment
Abstract
Financial institutions must track over 60,000 regulatory events annually, overwhelming manual compliance teams; the industry has paid over USD 300 billion in fines and settlements since the 2008 financial crisis. We present ComplianceNLP, an end-to-end system that automatically monitors regulatory changes, extracts structured obligations, and identifies compliance gaps against institutional policies. The system integrates three components: (1) a knowledge-graph-augmented RAG pipeline grounding generations in a regulatory knowledge graph of 12,847 provisions across SEC, MiFID II, and Basel III; (2) multi-task obligation extraction combining NER, deontic classification, and cross-reference resolution over a shared LEGAL-BERT encoder; and (3) compliance gap analysis that maps obligations to internal policies with severity-aware scoring. On our benchmark, ComplianceNLP achieves 87.7 F1 on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
