TL;DR
LexRAG introduces a new benchmark and toolkit for evaluating retrieval-augmented generation systems specifically in multi-turn legal consultations, addressing a critical gap in legal AI research.
Contribution
This paper presents LexRAG, the first comprehensive benchmark and toolkit for RAG in legal multi-turn conversations, including expert annotations and evaluation pipelines.
Findings
Existing RAG systems struggle with legal multi-turn conversations.
Legal domain-specific retrieval and response generation remain challenging.
LexRAG provides a standardized platform for future research and improvement.
Abstract
Retrieval-augmented generation (RAG) has proven highly effective in improving large language models (LLMs) across various domains. However, there is no benchmark specifically designed to assess the effectiveness of RAG in the legal domain, which restricts progress in this area. To fill this gap, we propose LexRAG, the first benchmark to evaluate RAG systems for multi-turn legal consultations. LexRAG consists of 1,013 multi-turn dialogue samples and 17,228 candidate legal articles. Each sample is annotated by legal experts and consists of five rounds of progressive questioning. LexRAG includes two key tasks: (1) Conversational knowledge retrieval, requiring accurate retrieval of relevant legal articles based on multi-turn context. (2) Response generation, focusing on producing legally sound answers. To ensure reliable reproducibility, we develop LexiT, a legal RAG toolkit that provides a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Byte Pair Encoding · Dense Connections · Linear Warmup With Linear Decay · Linear Layer · BART · Layer Normalization · Attention Dropout · Residual Connection
