InstructPatentGPT: Training patent language models to follow   instructions with human feedback

Jieh-Sheng Lee

arXiv:2406.16897·cs.CL·June 26, 2024

InstructPatentGPT: Training patent language models to follow instructions with human feedback

Jieh-Sheng Lee

PDF

TL;DR

This paper presents InstructPatentGPT, a patent language model trained with human feedback to improve patent claim generation, demonstrating controllability and potential for broader application with limited hardware.

Contribution

It introduces a reinforcement learning approach from human feedback for patent language models, focusing on claim quality and controllability, with a practical implementation on consumer hardware.

Findings

01

Model can learn from granted and pre-grant patents

02

Adjusts claim length and scope based on feedback

03

Operates efficiently on a single GPU

Abstract

In this research, patent prosecution is conceptualized as a system of reinforcement learning from human feedback. The objective of the system is to increase the likelihood for a language model to generate patent claims that have a higher chance of being granted. To showcase the controllability of the language model, the system learns from granted patents and pre-grant applications with different rewards. The status of "granted" and "pre-grant" are perceived as labeled human feedback implicitly. In addition, specific to patent drafting, the experiments in this research demonstrate the model's capability to learn from adjusting claim length and inclusion of limiting terms for narrowing claim scope. As proof of concept, the experiments focus on claim ones only and the training data originates from a patent dataset tailored specifically for artificial intelligence. Although the available…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.