Question-Answering Approach to Evaluating Legal Summaries

Huihui Xu; Kevin Ashley

arXiv:2309.15016·cs.CL·December 20, 2023·1 cites

Question-Answering Approach to Evaluating Legal Summaries

Huihui Xu, Kevin Ashley

PDF

Open Access 1 Repo

TL;DR

This paper introduces a GPT-4 based question-answering framework for evaluating legal summaries, focusing on argumentative structure rather than lexical overlap, and shows promising correlation with human judgments.

Contribution

It presents a novel GPT-4 driven evaluation method for legal summaries that considers argumentative content, improving upon traditional lexical overlap metrics.

Findings

01

GPT-4-based evaluation correlates well with human grading

02

The method effectively captures legal summary quality

03

It offers a new approach for legal summarization assessment

Abstract

Traditional evaluation metrics like ROUGE compare lexical overlap between the reference and generated summaries without taking argumentative structure into account, which is important for legal summaries. In this paper, we propose a novel legal summarization evaluation framework that utilizes GPT-4 to generate a set of question-answer pairs that cover main points and information in the reference summary. GPT-4 is then used to generate answers based on the generated summary for the questions from the reference summary. Finally, GPT-4 grades the answers from the reference summary and the generated summary. We examined the correlation between GPT-4 grading with human grading. The results suggest that this question-answering approach with GPT-4 can be a useful tool for gauging the quality of the summary.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

joycexu02/qa_evaluation
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Artificial Intelligence in Law

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Dropout · Adam · Layer Normalization · Label Smoothing · Byte Pair Encoding · Absolute Position Encodings · Dense Connections