Can Large Language Models Generate High-quality Patent Claims?
Lekang Jiang, Caiqi Zhang, Pascal A Scherz, Stephan Goetz

TL;DR
This study evaluates the ability of large language models to generate high-quality patent claims, revealing strengths in initial claim creation and highlighting the need for domain-specific models and further refinement.
Contribution
It constructs a new dataset for patent claim generation and compares general and patent-specific LLMs, showing GPT-4's superior performance and identifying areas for improvement.
Findings
GPT-4 outperforms other LLMs in patent claim quality
Current patent-specific LLMs underperform compared to general models
Fine-tuning improves claim completeness and clarity
Abstract
Large language models (LLMs) have shown exceptional performance across various text generation tasks but remain under-explored in the patent domain, which offers highly structured and precise language. This paper constructs a dataset to investigate the performance of current LLMs in patent claim generation. Our results demonstrate that generating claims based on patent descriptions outperforms previous research relying on abstracts. Interestingly, current patent-specific LLMs perform much worse than state-of-the-art general LLMs, highlighting the necessity for future research on in-domain LLMs. We also find that LLMs can produce high-quality first independent claims, but their performances markedly decrease for subsequent dependent claims. Moreover, fine-tuning can enhance the completeness of inventions' features, conceptual clarity, and feature linkage. Among the tested LLMs, GPT-4…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsIntellectual Property and Patents
MethodsAttention Is All You Need · Linear Layer · Multi-Head Attention · Softmax · Layer Normalization · Byte Pair Encoding · Label Smoothing · Position-Wise Feed-Forward Layer · Adam · Dense Connections
