PatentEval: Understanding Errors in Patent Generation

You Zuo (ALMAnaCH); Kim Gerdes (LISN); Eric Villemonte de La Clergerie; (ALMAnaCH); Beno\^it Sagot (ALMAnaCH)

arXiv:2406.06589·cs.CL·June 26, 2024

PatentEval: Understanding Errors in Patent Generation

You Zuo (ALMAnaCH), Kim Gerdes (LISN), Eric Villemonte de La Clergerie, (ALMAnaCH), Beno\^it Sagot (ALMAnaCH)

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a detailed error typology and a benchmark called PatentEval for assessing machine-generated patent texts, comparing various models and evaluating metrics against human judgments.

Contribution

It presents a new error typology and benchmark for evaluating patent text generation, including a comparative analysis of models and assessment metrics.

Findings

01

Human-annotated analysis of model performance

02

Evaluation of metrics against expert judgments

03

Insights into language models' capabilities in patent generation

Abstract

In this work, we introduce a comprehensive error typology specifically designed for evaluating two distinct tasks in machine-generated patent texts: claims-to-abstract generation, and the generation of the next claim given previous ones. We have also developed a benchmark, PatentEval, for systematically assessing language models in this context. Our study includes a comparative analysis, annotated by humans, of various models. These range from those specifically adapted during training for tasks within the patent domain to the latest general-purpose large language models (LLMs). Furthermore, we explored and evaluated some metrics to approximate human judgments in patent text evaluation, analyzing the extent to which these metrics align with expert assessments. These approaches provide valuable insights into the capabilities and limitations of current language models in the specialized…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zoeyou/patenteval
pytorchOfficial

Videos

PatentEval: Understanding Errors in Patent Generation· underline

Taxonomy

TopicsIntellectual Property and Patents

MethodsALIGN