Automatic Detection of Industry Sectors in Legal Articles Using Machine Learning Approaches
Hui Yang (1, 2), Stella Hadjiantoni (1), Yunfei Long (3), Ruta, Petraityte (2), Berthold Lausen (1, 4) ((1) Department of Mathematical, Sciences, University of Essex, Wivenhoe Park, Colchester, CO43SQ, UK, (2), Mondaq Ltd, Bristol, UK, (3) School of Computer Science, Electronic

TL;DR
This paper presents a machine learning approach combining NLP and statistical techniques to automatically identify industry sectors in legal articles, enabling targeted legal news dissemination and industry analysis.
Contribution
It introduces a novel ML-based system for industry classification in legal texts, comparing traditional ML and deep learning methods on a newly created dataset.
Findings
Achieved AUC scores above 0.90 and F-scores above 0.81.
Traditional ML methods outperform deep neural networks with limited domain-specific data.
System enables scalable and efficient processing of legal articles for industry classification.
Abstract
The ability to automatically identify industry sector coverage in articles on legal developments, or any kind of news articles for that matter, can bring plentiful of benefits both to the readers and the content creators themselves. By having articles tagged based on industry coverage, readers from all around the world would be able to get to legal news that are specific to their region and professional industry. Simultaneously, writers would benefit from understanding which industries potentially lack coverage or which industries readers are currently mostly interested in and thus, they would focus their writing efforts towards more inclusive and relevant legal news coverage. In this paper, a Machine Learning-powered industry analysis approach which combined Natural Language Processing (NLP) with Statistical and Machine Learning (ML) techniques was investigated. A dataset consisting of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Law · Computational and Text Analysis Methods · Law, AI, and Intellectual Property
