ILSIC: Corpora for Identifying Indian Legal Statutes from Queries by Laypeople
Shounak Paul, Raghav Dogra, Pawan Goyal, Saptarshi Ghosh

TL;DR
This paper introduces ILSIC, a new corpus of Indian legal queries from laypeople, enabling better understanding and modeling of informal legal questions distinct from court judgments.
Contribution
The creation of ILSIC, a comprehensive dataset of laypeople legal queries and court judgments, and extensive benchmarking to analyze differences and transfer learning potential.
Findings
Models trained on court judgments perform poorly on laypeople queries.
Transfer learning can improve performance in certain scenarios.
Fine-grained analysis reveals variations across query categories and statute frequencies.
Abstract
Legal Statute Identification (LSI) for a given situation is one of the most fundamental tasks in Legal NLP. This task has traditionally been modeled using facts from court judgments as input queries, due to their abundance. However, in practical settings, the input queries are likely to be informal and asked by laypersons, or non-professionals. While a few laypeople LSI datasets exist, there has been little research to explore the differences between court and laypeople data for LSI. In this work, we create ILSIC, a corpus of laypeople queries covering 500+ statutes from Indian law. Additionally, the corpus also contains court case judgements to enable researchers to effectively compare between court and laypeople data for LSI. We conducted extensive experiments on our corpus, including benchmarking over the laypeople dataset using zero and few-shot inference, retrieval-augmented…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Law · Topic Modeling · Multi-Agent Systems and Negotiation
