Natural Language Processing in the Legal Domain
Dirk Hartung, Daniel Martin Katz, Michael J. Bommarito, Lauritz Gerlach, Abhik Jana, and Jerrold Soh

TL;DR
This paper reviews the evolution of NLP applied to legal texts, analyzing nearly a thousand papers from 2013 to 2024, highlighting trends in research activity, methods, and data practices.
Contribution
It provides a comprehensive analysis of the field's development, emphasizing increased research volume, methodological sophistication, and improved data and code transparency.
Findings
Growth in number of legal NLP papers over a decade
Increase in methodological complexity and sophistication
Improved data availability and code reproducibility
Abstract
We summarize the current state of the field of NLP & Law with a specific focus on recent technical and substantive developments. To support our analysis, we construct and analyze a nearly complete corpus of nearly one thousand NLP & Law related papers published between 2013-2024. Our analysis highlights several major trends. Namely, we document an increasing number of papers written, tasks undertaken, and languages covered over the course of the past decade. We observe an increase in the sophistication of the methods which researchers deployed in this applied context. Legal NLP is beginning to match not only the methodological sophistication of general NLP but also the professional standards of data availability and code reproducibility observed within the broader scientific community. We believe all of these trends bode well for the future of the field and point to an exciting next…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
