NESTLE: a No-Code Tool for Statistical Analysis of Legal Corpus
Kyoungyeon Cho, Seungkum Han, Young Rok Choi, Wonseok Hwang

TL;DR
NESTLE is a no-code tool that leverages LLMs and custom IE systems to enable flexible, large-scale statistical analysis of legal corpora without programming, validated on Korean legal tasks.
Contribution
It introduces a comprehensive no-code platform combining LLMs and custom IE for customizable legal corpus analysis, filling a gap in existing tools.
Findings
Achieves GPT-4 level performance with minimal labeled data.
Successfully applied to Korean legal tasks.
Enables unlimited customizable analysis without coding.
Abstract
The statistical analysis of large scale legal corpus can provide valuable legal insights. For such analysis one needs to (1) select a subset of the corpus using document retrieval tools, (2) structure text using information extraction (IE) systems, and (3) visualize the data for the statistical analysis. Each process demands either specialized tools or programming skills whereas no comprehensive unified "no-code" tools have been available. Here we provide NESTLE, a no-code tool for large-scale statistical analysis of legal corpus. Powered by a Large Language Model (LLM) and the internal custom end-to-end IE system, NESTLE can extract any type of information that has not been predefined in the IE system opening up the possibility of unlimited customizable statistical analysis of the corpus without writing a single line of code. We validate our system on 15 Korean precedent IE tasks and 3…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Law · Natural Language Processing Techniques · Legal Language and Interpretation
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Residual Connection · Adam · Byte Pair Encoding · Softmax · Dropout · Label Smoothing · Absolute Position Encodings
