Interpretable Long-Form Legal Question Answering with   Retrieval-Augmented Large Language Models

Antoine Louis; Gijs van Dijck; Gerasimos Spanakis

arXiv:2309.17050·cs.CL·October 2, 2023·2 cites

Interpretable Long-Form Legal Question Answering with Retrieval-Augmented Large Language Models

Antoine Louis, Gijs van Dijck, Gerasimos Spanakis

PDF

Open Access 1 Repo 4 Models 1 Datasets 1 Video

TL;DR

This paper introduces a novel end-to-end approach for long-form legal question answering using retrieval-augmented large language models, supported by a new expert-annotated dataset in French, aiming to improve legal literacy and model evaluation.

Contribution

It presents a retrieve-then-read pipeline for long-form legal answers and releases the LLeQA dataset, a comprehensive resource for French legal questions and answers.

Findings

01

Promising performance on automatic evaluation metrics.

02

Qualitative analysis highlights areas for model refinement.

03

LLeQA dataset enables rigorous NLP model evaluation in legal domain.

Abstract

Many individuals are likely to face a legal dispute at some point in their lives, but their lack of understanding of how to navigate these complex issues often renders them vulnerable. The advancement of natural language processing opens new avenues for bridging this legal literacy gap through the development of automated legal aid systems. However, existing legal question answering (LQA) approaches often suffer from a narrow scope, being either confined to specific legal domains or limited to brief, uninformative responses. In this work, we propose an end-to-end methodology designed to generate long-form answers to any statutory law questions, utilizing a "retrieve-then-read" pipeline. To support this approach, we introduce and release the Long-form Legal Question Answering (LLeQA) dataset, comprising 1,868 expert-annotated legal questions in the French language, complete with detailed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

maastrichtlawtech/lleqa
pytorchOfficial

Models

Datasets

maastrichtlawtech/lleqa
dataset· 28 dl
28 dl

Videos

Interpretable Long-Form Legal Question Answering with Retrieval-Augmented Large Language Models· underline

Taxonomy

TopicsArtificial Intelligence in Law · Topic Modeling · Natural Language Processing Techniques