Linguistic Legal Concept Extraction in Portuguese
Alessandra Cid, Alexandre Rademaker, Bruno Cuconato, Valeria, de Paiva

TL;DR
This paper explores legal concept extraction in Portuguese, focusing on the OAB exam, by enriching the OpenWordNet-PT lexical database with relevant legal terms and concepts.
Contribution
It introduces a method to identify and incorporate legal concepts from exam texts into OpenWordNet-PT, enhancing its coverage for legal language processing.
Findings
Expanded OpenWordNet-PT with new legal concepts
Improved lexical coverage for Portuguese legal language
Enhanced resources for legal NLP applications
Abstract
This work investigates legal concepts and their expression in Portuguese, concentrating on the "Order of Attorneys of Brazil" Bar exam. Using a corpus formed by a collection of multiple-choice questions, three norms related to the Ethics part of the OAB exam, language resources (Princeton WordNet and OpenWordNet-PT) and tools (AntConc and Freeling), we began to investigate the concepts and words missing from our repertory of concepts and words in Portuguese, the knowledge base OpenWordNet-PT. We add these concepts and words to OpenWordNet-PT and hence obtain a representation of these texts that is "contained" in the lexical knowledge base.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Law · Natural Language Processing Techniques · Legal Language and Interpretation
